Gene Tmz1t_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3699 
Symbol 
ID7873698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4065246 
End bp4066523 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content71% 
IMG OID643700645 
Productdihydroorotase 
Protein accessionYP_002890669 
Protein GI237654355 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCC TGATTTCCAA CGGCCGCGTC GTCGATCCGG CCAATCGCAC CGATGCGGTG 
CAGAACGTCT ACGTCGCAGG AGGCAAGATC GTCGCCCTCG GCCAGGCGCC GGACGGCTTC
GTCGCCGAGC GCACGATCGA CGCCGGCGGT CTCGTGGTCG CCCCCGGCTT CATCGACCTC
GCCGCGCGCC TGCGCGAACC CGGCTACGAG TACCGCGCCA CGCTCGAATC CGAAATGGAG
GCGGCGATGG CCGGCGGCGT CACCAGCCTG GCGATCCCGC CCGACACCGA CCCCGTGCTC
GACGAGCCCG GCCTGGTCGA GATGCTGACC TATCGCGCCA AGAAGCTGAA CCGCGCCCAC
ATCTATCCGG TCGGCGCACT CACCATCGGC CTTCAGGGCG AGCGCCTGTC CGAGATGGCC
GAACTGGTCG AGGCCGGCTG CGTCGCCTTC TCGCAGGCCA ACGTGCCGCT GGTCGACAAC
ACCGTGCTGA TGCGCGCGCT GCAATACGCC GCCACTTTCG GCTTCCGCGT CTGGCTGCAG
CCGCTGGCGC CCTTCCTGTC GCAGGTGGGC CACGCCCACG ACGGCGAGGT GGCGACGCGG
CTGGGCCTGT CGGGCATCCC GGTCGCCGCC GAAACCGTGG CGCTCTACAC CTACCTCGAG
CTCGCGCGCA TCACCGGCGC CCGCCTGCAC ATCACCCGCC TGTCCTCGGC CGCCGGCCTC
GCGCTCATCG ACCAGGCGCG CGCAGAAGGC ATGGACGTGA CCTGCGACGT GTCGATCAAC
CATGTGCACC TGTGCGACAT GGACATCGGC TACTTCAACC CCAACTGCCA CCTCGTCCCG
CCGCTGCGCA GCCAGCGCGA CCGCGAGGCA CTCGCCCGGG GCCTGGCCGA GGGCCGCATC
GACGCGCTGT GCTCGGACCA CACCCCGGTG GACGACGACG CCAAGCAGAC GCCGTTCTCC
GAATCCGAAC CCGGCGCCAC CGGCCTCGAG CTGCTGCTGC CGCTGACGCT GAAGTGGGCC
GACCGCGCCG GGCTGGCGCT GCTGGACGGG CTGGCCCGCA TCACCTCGGA CGCGGCGAAG
ATCGTCGGCA TCACCAAGGC CGGCCACCTC TCGGTGGGCG CGCGCGCCGA CGTGTGCGTG
TTCGACCCCG CGACCCACGT CACCATCACC CGCGAGGGCC TCCGGAGCCA GGGCAAGAAC
ACGCCCTTCC TCGGCATGGA GCTGCCGGGC AAGGTGCGCT ACACGCTGGT CGAGGGGCAG
GTGATGTTCG AGGGCTGA
 
Protein sequence
MNILISNGRV VDPANRTDAV QNVYVAGGKI VALGQAPDGF VAERTIDAGG LVVAPGFIDL 
AARLREPGYE YRATLESEME AAMAGGVTSL AIPPDTDPVL DEPGLVEMLT YRAKKLNRAH
IYPVGALTIG LQGERLSEMA ELVEAGCVAF SQANVPLVDN TVLMRALQYA ATFGFRVWLQ
PLAPFLSQVG HAHDGEVATR LGLSGIPVAA ETVALYTYLE LARITGARLH ITRLSSAAGL
ALIDQARAEG MDVTCDVSIN HVHLCDMDIG YFNPNCHLVP PLRSQRDREA LARGLAEGRI
DALCSDHTPV DDDAKQTPFS ESEPGATGLE LLLPLTLKWA DRAGLALLDG LARITSDAAK
IVGITKAGHL SVGARADVCV FDPATHVTIT REGLRSQGKN TPFLGMELPG KVRYTLVEGQ
VMFEG