Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2676 |
Symbol | |
ID | 4808844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3156409 |
End bp | 3157839 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108091 |
Product | GumN |
Protein accession | YP_001039068 |
Protein GI | 125975158 |
COG category | [S] Function unknown |
COG ID | [COG3735] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000184674 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CAGCAATGCT GAAAACAATC GCAGCGGCAC TGATTATTGT ACTTTCGCTG CAAGTATTTG TGTTTGCCGA AGAGCAGCCG CAACTTCTTA TATCGCCTCC TGCTGAGCAG CCGGCCGCAT GGGCTGTGGA AGCCGTACAA TGGTCGTCAA TATATGGCCT TGCATCCGAT GAGATGTTTG CAAAATATTC ATCCAAAGTA ACTCAGGAAG AGCTGCATAA AGTTTGCGTA AACCTTTATG AAAAATTAAC AGGTAAAACT GCTACACCGG AAGAAGAAAA GATTTTTACG GATAACAGCA AACTTACAAC TCAAAAAGAA GCAACAAGGC TGGAAATGGT CACAAGCGTA TACAACGTGT TAAAAGCAGC ACAACCCGAG TTTGATTTCA GCGCCGATGT AAACCTCACT TTCAAAGATA TTGGGTCATT GTCCGAAGAA ACATTGAATA TTGTAAAGTA TTCCGTAGCA AAAGGGATAT TGCACGGAAG AAATAAAGAA ATCCTTGACC TTGAAAGCCA GTGTACAAGA CAGGAACTAT TAGTGTTTGT AAAAAATGCT TATGAATTTG CCATATATGA GTCGGGAAGA TATTCAAAAG GCGCCTTCTG GAAAGTAAGT GACGAGAACA ACACCGTTTA TCTTTTAGGT TCAATACATA TTGCGGACGC AACCCTGTAC CCGTTGTCAA AAGAGATACT GAACGCCTAT GAAAAATCCG ATGTTTTAGT TGTCGAAGCG GATATTTCAA AACAGCAAGA AGCCGCGAAT TATATGGCAC AAAGGGCTAT GTACGCAGAT GAAAACACTC TTGAGAAGAA TGTGCCTGAA GAGCTTTACA AAAAATTTGT GGAGTTTGTT ACTCCTTATG GTATTCAGGA GGAAGTATAC AGCAAGCTCA AGCCCTGGTA TGCAGCATTG CTGGTTCAAA ACCTGCAGCT TATGGACAAC TCATACAGCG GAAGCTTGGG AGTGGATATG TATTTCCTTT CAAAGGCAAT GGGCCAAAAA GACATATTGG AAATAGAAGG AATCAAGTTT CAGGTGGATA TGTTTGACTC CTTCTCAAAT GAGCTTCAAT GTCAATTTTT AGCTTCAGCT TTAGGCACCG GTGAAGGAAA TGAAAATACG GAAGCAAGTG TAGAGTTGGT TGCCTATATG TTAAAGTGCT GGAAAGAGGG CAATACAGAA GAACTTGCAA GGATAGTGAA AGCTGACGTT GAAGCTGAAG GAGAATTTAA AGAGTTTAAT GAAAAGATGT GGTCGTCAAG AGACAATAAC ATGGTTCAAA AGGTAAGAGA ATACCTTGCC GATCCGGAAA ACAAAACTTA TTTTGTAGTA GTCGGAGCGG GACATATGGT GGGAAGCACC GGAATTGTCA CACAATTGGA AGATGAATAC AAGGTGGAAC AAATCAAATG A
|
Protein sequence | MKKTAMLKTI AAALIIVLSL QVFVFAEEQP QLLISPPAEQ PAAWAVEAVQ WSSIYGLASD EMFAKYSSKV TQEELHKVCV NLYEKLTGKT ATPEEEKIFT DNSKLTTQKE ATRLEMVTSV YNVLKAAQPE FDFSADVNLT FKDIGSLSEE TLNIVKYSVA KGILHGRNKE ILDLESQCTR QELLVFVKNA YEFAIYESGR YSKGAFWKVS DENNTVYLLG SIHIADATLY PLSKEILNAY EKSDVLVVEA DISKQQEAAN YMAQRAMYAD ENTLEKNVPE ELYKKFVEFV TPYGIQEEVY SKLKPWYAAL LVQNLQLMDN SYSGSLGVDM YFLSKAMGQK DILEIEGIKF QVDMFDSFSN ELQCQFLASA LGTGEGNENT EASVELVAYM LKCWKEGNTE ELARIVKADV EAEGEFKEFN EKMWSSRDNN MVQKVREYLA DPENKTYFVV VGAGHMVGST GIVTQLEDEY KVEQIK
|
| |