Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1050 |
Symbol | |
ID | 7084034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1151613 |
End bp | 1153397 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698068 |
Product | Na/Pi-cotransporter II-related protein |
Protein accession | YP_002354708 |
Protein GI | 217969474 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1283] Na+/phosphate symporter |
TIGRFAM ID | [TIGR00704] Na/Pi-cotransporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.231321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCATCCG GTCCAAAGCG CGATGGTCCC CGCCGCGAAC AATGCCCTCC CCGCCCGCGC GATCCGGCAT TACGCTGTCG TGACGCCCCC TTCCCGCACG AGAGGCCCGC CCTGGACACC AGCCCGCTCG CCATCTATCC GATCGTCGCC GGCCTGCTCG GCGGCATCGG CCTGTTCCTG CTCGGCATGC ACATGCTCAC CGAGGGCCTC AAGCTCGCCG CCGGCCGCGC GCTCGAGGGC CTGCTCGAGC GCGGCACCGC CACGCCGGTG CGCGGCCTCG GCGCCGGCAT GACCATGACC GCGCTGGTGC AGTCCTCCAC CGCGGTCACG GTGGCGAGCA TCGGCTTCGT CAACACCGGG CTGCTGTCGC TGCAGAACGC GATGTGGGTG ATCTTCGGCT CCAACGTCGG CACCACGCTC AACGCCTGGC TGGTGGCGGC GCTCGGCTTC AGCTTCCGGA TCGACGCCTT CGCGCTGCCC TTCGTCGGCA TCGGCGCGAC CCTGATGCTG GCCGGACGCA CGGTCCGCCA GCGCGCGCTC GGCCAGGCGC TGGCCGGCTT CGGCGTGCTC TTCCTCGGCA TCGACGCACT CAAGGACACC TTCTCGGGTT TCGGCGCGAC CATGAACCTG CAGGACCACA TCGCACCCGG CTTCACCGGC TGGCTGATCC TGGTCGGCAT CGGCACCGTG CTGACGGTGC TGATGCAGGC CTCGGGCGCG GTGATCGCGA TCATCATCAC TGCCGCCCAG GGCGGGCTGA TGTCGATCGA GGCGGCGTGC GCGATGGTGA TCGGCACCAA CATCGGCACG ACCTCGACGG CGATCCTGTC GGCGCTGGGC GCGACCTCGA ACGCGCGCCG GGTCGCGGCC ACCCACGTGA TCTTCAACCT GGTGACCGGC GCGGTGGCGA TCGCGCTGCT GCCGCTGCTG ATCGGCCTGC TCGGCGCGCT GCGCAACTGG TTCGAGCAGC CGGCCACGCC GGCGGTGATG CTGGCAATGT TCCACACCGC CTTCAACGTG CTCGGGGTGC TGCTGATGGT GCCGCTGGCA CGCCCCCTGC GCCGATTCCT GGCGAGCCAC TTCCGCAGCC GCGAGGAGGA GATCGCCCGC CCGCGCTATC TCGACGCGCC CTCGCTGGCG GTGCCCGATC TGGCGCTGCG CGCGCTGCGC CTGGAGTTGG GCCGCACGCA GTCCTTCGCC CTCACCGCGC TCACCGCCGC CACGCGGGTG CCGCCGGACG AGCCCTGGAT CGAGCGTCAG GCGAGCACGC TCGATGCGCT GGCGCCGGCG ATCGGCGACT ACGTACGCAA GCTCAACGCC ATGCGCCTGC CGCCGGCGCT GGTCGAGGCG GTGGCGCACA GCCTGCGCGC GCTGCAGTAT CAGGAGAGCG CGGTCACCGC GGTGCGCCAG GCGTGCGCGC TGGCCACCAC GCTCGGCGCG CCGCCGGGCA GCGAGCTCGA GCCGATCGCC GCGGCCTTCC GCCAGGCCAC CGGGGCGCTG GCCGCGAGCG CGGACGCGGC GCGCGAGGAC TTCTCCGCGC CGGCGGTCGA GGTGCGGCTG GAGGAGGCGG AGCTGCGCTA CCAGGCCTTC AAGGAGGCGC TGCTGCTGGA GGGTGCGCAC GCCCGCATCG ACATTCGCAC GCTGCAGGAC TGGCTGCGCC TGGCCAGCCT GGAGCGCCGC GCGATCGAGC AGGTGGCCAA GGCGGCGCGC ATGCTGGCGG TACTGGACGG CGACCTGACG CCGCAGCAGG CGGAGGAAAA GGCGGGAGAG AACGGGGTGG AGTGA
|
Protein sequence | MPSGPKRDGP RREQCPPRPR DPALRCRDAP FPHERPALDT SPLAIYPIVA GLLGGIGLFL LGMHMLTEGL KLAAGRALEG LLERGTATPV RGLGAGMTMT ALVQSSTAVT VASIGFVNTG LLSLQNAMWV IFGSNVGTTL NAWLVAALGF SFRIDAFALP FVGIGATLML AGRTVRQRAL GQALAGFGVL FLGIDALKDT FSGFGATMNL QDHIAPGFTG WLILVGIGTV LTVLMQASGA VIAIIITAAQ GGLMSIEAAC AMVIGTNIGT TSTAILSALG ATSNARRVAA THVIFNLVTG AVAIALLPLL IGLLGALRNW FEQPATPAVM LAMFHTAFNV LGVLLMVPLA RPLRRFLASH FRSREEEIAR PRYLDAPSLA VPDLALRALR LELGRTQSFA LTALTAATRV PPDEPWIERQ ASTLDALAPA IGDYVRKLNA MRLPPALVEA VAHSLRALQY QESAVTAVRQ ACALATTLGA PPGSELEPIA AAFRQATGAL AASADAARED FSAPAVEVRL EEAELRYQAF KEALLLEGAH ARIDIRTLQD WLRLASLERR AIEQVAKAAR MLAVLDGDLT PQQAEEKAGE NGVE
|
| |