Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2810 |
Symbol | |
ID | 4809647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3314844 |
End bp | 3316598 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108230 |
Product | Na/Pi-cotransporter II-related protein |
Protein accession | YP_001039202 |
Protein GI | 125975292 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1283] Na+/phosphate symporter |
TIGRFAM ID | [TIGR00704] Na/Pi-cotransporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCTGT TTGGTTTTTT AGAGCTTATA GGCGGCTTGT CCCTGTTCCT CTTTGGTATG AGCCTTATGG GAACAGGTCT TGAAAAAAGT GCGGGCAACA AGCTGAAGGT CTTGCTGGAG CGTTTGACGT CGAAAAAGCT GAATGGTTTT TTGATGGGTC TGGCGGTGAC CGCCATCATA CAAAGCTCTT CAGCTACTAC AGTTATGGTG GTTGGGTTCG TAAACTCCGG TATAATGACC CTAAAGCAAG CGATACACGT TATCATGGGA GCAAACGTCG GTACGACGGT TACAGCATGG ATTTTAAGCC TGGCGGGAAT TGAAGGCAGC AATTTGTTTT TAAAATTATT AAAGCCTTCT TCTTTTACGC CGGTGCTGGC TTTAGTTGGT ATTATATATT ATTTGTTTAT CAAAAATGAA AGAAAGAAAG ATATTGGATT AATTCTGCTT GGGTTTGCAA CGCTGATTTA CGGTATGGAA GGAATGTCGG CGGCAGTTAG GCCGTTAGGT GAAATGGAAG AATTTCGCAA TATACTGTTA ATTTTTTCAA ATCCAGTACT TGGGGTGCTG CTTGGTGCCA TTGTGACAGG TATTATCCAA AGTTCGTCCG CTTCGGTGGG AATCCTTCAG GCCTTGTCCG CTACGGGACA GGTCACGATG GGAACAGCTA TACCTATAAT AATGGGGCAG AATATCGGTA CTTGTGTTAC AGCGTTAATT TCATCGGTGG GAACAAATAA AAATGCCCGT CGTGCGGCAA TGGTTCACCT TTACTTCAAT CTTATCGGTA CAGTGGTATT TCTTACGTTG TTTACCGTAT TAGACAGTTT GTTTAATTTC GCTTTTCTTG ATTGGCCGTC AAATCACTTC TTTATTGCCG TTGTCCATTC GTTGTTCAAT ATTTTGTGTA CGGCTATGCT TCTTCCCTTT AGCGGACTTC TTGAAAAGCT GGCATACAAG ACTATAAAAG AGGACGATAA AAAGGACAAG GTGTCACTGC TTGATCCTCG TTTGTTTTCA ACACCTGCCA TAGCGATCAA CCGCAGCAGG GAGATTGCAA GAGAAATGGC GTATGCTTCA GTGGATGCTA TAAAGAAGGC AGTTGCTCTG GTTAACAATT ATGACGAAAA AGAAGCACAA AAGGTAAGGG ATGCGGAGCA GCAAACTGAC AAATACGAGG ATGCTCTGGG AACTTATCTT GTGAGGCTTT CAAGCCAGAA TCTTTCGGCC CGGGATACCA CGGAAGCTGC AAAACTGCTT TTCCTGATAG GAGAGTTTGA GCGCATTGCG GATTACGCAT TAAATATTGT GGATTCCGCC CAGGAAATGT ATGATAAAAA GATGCAGTTT TCCGAATCGG CCAAAAAAGA ACTGGGTGTT ATGATGTCGG CTGTGCAGGA AGTGGTTGAG ACAGCCGTCA GGGCATTTGA CGGCAACGAT GTTTTACTGG CGGCTAAGGT TGACCCTCTT GAGGAAGTAA TTGATGGCCT TAAGAGTACG CTTAAAAAGA AACATATTGA AAGATTGCAG TGTAACAAGT GTACAATTGA GATGGGATTT GTGTTTTCCG ATTTGATAAC GGCATTGGAG CGTATATCCG ACCACTGTTC CAACATCGCA GGGTGCTTAA TAGAGATGTC CCACGATAGT CTGAACATGC ACAGTTATCT GTACAAGCTC AAGCATGAGC CAAATGATGA ATTCCTCAGG CAATTTAATG AGTATTCGGC CAAATATGCC ATAGACGGAA ATTAA
|
Protein sequence | MDLFGFLELI GGLSLFLFGM SLMGTGLEKS AGNKLKVLLE RLTSKKLNGF LMGLAVTAII QSSSATTVMV VGFVNSGIMT LKQAIHVIMG ANVGTTVTAW ILSLAGIEGS NLFLKLLKPS SFTPVLALVG IIYYLFIKNE RKKDIGLILL GFATLIYGME GMSAAVRPLG EMEEFRNILL IFSNPVLGVL LGAIVTGIIQ SSSASVGILQ ALSATGQVTM GTAIPIIMGQ NIGTCVTALI SSVGTNKNAR RAAMVHLYFN LIGTVVFLTL FTVLDSLFNF AFLDWPSNHF FIAVVHSLFN ILCTAMLLPF SGLLEKLAYK TIKEDDKKDK VSLLDPRLFS TPAIAINRSR EIAREMAYAS VDAIKKAVAL VNNYDEKEAQ KVRDAEQQTD KYEDALGTYL VRLSSQNLSA RDTTEAAKLL FLIGEFERIA DYALNIVDSA QEMYDKKMQF SESAKKELGV MMSAVQEVVE TAVRAFDGND VLLAAKVDPL EEVIDGLKST LKKKHIERLQ CNKCTIEMGF VFSDLITALE RISDHCSNIA GCLIEMSHDS LNMHSYLYKL KHEPNDEFLR QFNEYSAKYA IDGN
|
| |