Gene Tmz1t_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0161 
Symbol 
ID7085258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp186140 
End bp187246 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content73% 
IMG OID643697203 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002353852 
Protein GI217968618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATCA GGGGTGGGAG GGCCGCGCTG AAGACGGTGC TGGCGGTCGT GGCGGCGGTG 
GCGATCGCCC CGGCGCGGGC CGAGCCGATC GCCGATCGCT TCGATCAGGC ACGCCTGGAG
GGCGTGGTGG TCGTCTATGC GGCGACCGAC CTCGCGGTGG TCAAGCCGGT CATCGACGAC
TTCGAGGCCC TCCATCCCGG CGTTCGGGTG CAGTACCACG ACATGCACTC GGCCGAACTC
CATGCGCGCG TGGTCGACGA GGCCCGGCGC GGGCTGGCCG GTGCCGACGT GGTGTGGAGC
TCGGCGATGG ACCTGCAGGT GAAGCTGGTC AACGACGGCC ACGCCCAGCC GCACCGCTCC
GCCGAGACCG CGGCGCTGCC GCGCTGGGCG GTGTGGAAGG ACGAGGCCTT CGGCACCACC
TACGAGCCGG CGGTGATCGT CTACAACAAG CATCTGCTCG GCACGACCGA GGTGCCCGAC
AGCCATGCGG AGCTGATCCG CCTGCTCGAT CGCGACCCGG CGCCGTTGCG CGGGCGCATC
GCCACCTACG ACCCCGAGCG CTCCGGCCTC GGCCTGCTGC TGCACACGCA GGACGCGCAG
GCCAACCCGA TCGTGTTCTG GCAGCTCGCG CGCGGCATGG GCCGGCAGGG CCTGGAGCAG
CACGCGGCGA GCAGCGAGAT GCTCGACCGC GTCGCCGCGG GCAAGCTGGT GCTCGCCTAC
AACGTGCTGG GCTCGTACGC GCACCGGCGG GCGCGCAGCG ATCCGGCGCT CGGGGTGGCG
CTGCCGCGGG ACTACACGCT GGTGCTGAGC CGGGTCGCCT TCATCGTGCG TGGCGCGCGT
CATCCGGCGG CGGCGCGCCT GTGGCTCGAT CATCTGCTGT CGACCCGAGG CCAGGCCCTG
CTCGCCGCCA ACCTCGGCCT GCTGCCGGTG CGCACCGACG CCGGCACCGC GGGCGCGGAC
AGCGCTGCCG CGCTGCTCCA CCACAACCTG CAGCATGCCT TCCGCCCGAT CCGCATCGGC
TCCGGGCTGC TCGCCTACCA GGACCAGGCC AAGAAGCAGG CCTTCCTGCG CCAGTGGGAT
GCGGCGACGC GGCCGGCTTC CGAGTGA
 
Protein sequence
MRIRGGRAAL KTVLAVVAAV AIAPARAEPI ADRFDQARLE GVVVVYAATD LAVVKPVIDD 
FEALHPGVRV QYHDMHSAEL HARVVDEARR GLAGADVVWS SAMDLQVKLV NDGHAQPHRS
AETAALPRWA VWKDEAFGTT YEPAVIVYNK HLLGTTEVPD SHAELIRLLD RDPAPLRGRI
ATYDPERSGL GLLLHTQDAQ ANPIVFWQLA RGMGRQGLEQ HAASSEMLDR VAAGKLVLAY
NVLGSYAHRR ARSDPALGVA LPRDYTLVLS RVAFIVRGAR HPAAARLWLD HLLSTRGQAL
LAANLGLLPV RTDAGTAGAD SAAALLHHNL QHAFRPIRIG SGLLAYQDQA KKQAFLRQWD
AATRPASE