Gene Tmz1t_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1050 
Symbol 
ID7084034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1151613 
End bp1153397 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content72% 
IMG OID643698068 
ProductNa/Pi-cotransporter II-related protein 
Protein accessionYP_002354708 
Protein GI217969474 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.231321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCATCCG GTCCAAAGCG CGATGGTCCC CGCCGCGAAC AATGCCCTCC CCGCCCGCGC 
GATCCGGCAT TACGCTGTCG TGACGCCCCC TTCCCGCACG AGAGGCCCGC CCTGGACACC
AGCCCGCTCG CCATCTATCC GATCGTCGCC GGCCTGCTCG GCGGCATCGG CCTGTTCCTG
CTCGGCATGC ACATGCTCAC CGAGGGCCTC AAGCTCGCCG CCGGCCGCGC GCTCGAGGGC
CTGCTCGAGC GCGGCACCGC CACGCCGGTG CGCGGCCTCG GCGCCGGCAT GACCATGACC
GCGCTGGTGC AGTCCTCCAC CGCGGTCACG GTGGCGAGCA TCGGCTTCGT CAACACCGGG
CTGCTGTCGC TGCAGAACGC GATGTGGGTG ATCTTCGGCT CCAACGTCGG CACCACGCTC
AACGCCTGGC TGGTGGCGGC GCTCGGCTTC AGCTTCCGGA TCGACGCCTT CGCGCTGCCC
TTCGTCGGCA TCGGCGCGAC CCTGATGCTG GCCGGACGCA CGGTCCGCCA GCGCGCGCTC
GGCCAGGCGC TGGCCGGCTT CGGCGTGCTC TTCCTCGGCA TCGACGCACT CAAGGACACC
TTCTCGGGTT TCGGCGCGAC CATGAACCTG CAGGACCACA TCGCACCCGG CTTCACCGGC
TGGCTGATCC TGGTCGGCAT CGGCACCGTG CTGACGGTGC TGATGCAGGC CTCGGGCGCG
GTGATCGCGA TCATCATCAC TGCCGCCCAG GGCGGGCTGA TGTCGATCGA GGCGGCGTGC
GCGATGGTGA TCGGCACCAA CATCGGCACG ACCTCGACGG CGATCCTGTC GGCGCTGGGC
GCGACCTCGA ACGCGCGCCG GGTCGCGGCC ACCCACGTGA TCTTCAACCT GGTGACCGGC
GCGGTGGCGA TCGCGCTGCT GCCGCTGCTG ATCGGCCTGC TCGGCGCGCT GCGCAACTGG
TTCGAGCAGC CGGCCACGCC GGCGGTGATG CTGGCAATGT TCCACACCGC CTTCAACGTG
CTCGGGGTGC TGCTGATGGT GCCGCTGGCA CGCCCCCTGC GCCGATTCCT GGCGAGCCAC
TTCCGCAGCC GCGAGGAGGA GATCGCCCGC CCGCGCTATC TCGACGCGCC CTCGCTGGCG
GTGCCCGATC TGGCGCTGCG CGCGCTGCGC CTGGAGTTGG GCCGCACGCA GTCCTTCGCC
CTCACCGCGC TCACCGCCGC CACGCGGGTG CCGCCGGACG AGCCCTGGAT CGAGCGTCAG
GCGAGCACGC TCGATGCGCT GGCGCCGGCG ATCGGCGACT ACGTACGCAA GCTCAACGCC
ATGCGCCTGC CGCCGGCGCT GGTCGAGGCG GTGGCGCACA GCCTGCGCGC GCTGCAGTAT
CAGGAGAGCG CGGTCACCGC GGTGCGCCAG GCGTGCGCGC TGGCCACCAC GCTCGGCGCG
CCGCCGGGCA GCGAGCTCGA GCCGATCGCC GCGGCCTTCC GCCAGGCCAC CGGGGCGCTG
GCCGCGAGCG CGGACGCGGC GCGCGAGGAC TTCTCCGCGC CGGCGGTCGA GGTGCGGCTG
GAGGAGGCGG AGCTGCGCTA CCAGGCCTTC AAGGAGGCGC TGCTGCTGGA GGGTGCGCAC
GCCCGCATCG ACATTCGCAC GCTGCAGGAC TGGCTGCGCC TGGCCAGCCT GGAGCGCCGC
GCGATCGAGC AGGTGGCCAA GGCGGCGCGC ATGCTGGCGG TACTGGACGG CGACCTGACG
CCGCAGCAGG CGGAGGAAAA GGCGGGAGAG AACGGGGTGG AGTGA
 
Protein sequence
MPSGPKRDGP RREQCPPRPR DPALRCRDAP FPHERPALDT SPLAIYPIVA GLLGGIGLFL 
LGMHMLTEGL KLAAGRALEG LLERGTATPV RGLGAGMTMT ALVQSSTAVT VASIGFVNTG
LLSLQNAMWV IFGSNVGTTL NAWLVAALGF SFRIDAFALP FVGIGATLML AGRTVRQRAL
GQALAGFGVL FLGIDALKDT FSGFGATMNL QDHIAPGFTG WLILVGIGTV LTVLMQASGA
VIAIIITAAQ GGLMSIEAAC AMVIGTNIGT TSTAILSALG ATSNARRVAA THVIFNLVTG
AVAIALLPLL IGLLGALRNW FEQPATPAVM LAMFHTAFNV LGVLLMVPLA RPLRRFLASH
FRSREEEIAR PRYLDAPSLA VPDLALRALR LELGRTQSFA LTALTAATRV PPDEPWIERQ
ASTLDALAPA IGDYVRKLNA MRLPPALVEA VAHSLRALQY QESAVTAVRQ ACALATTLGA
PPGSELEPIA AAFRQATGAL AASADAARED FSAPAVEVRL EEAELRYQAF KEALLLEGAH
ARIDIRTLQD WLRLASLERR AIEQVAKAAR MLAVLDGDLT PQQAEEKAGE NGVE