Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1237 |
Symbol | |
ID | 5877886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1278305 |
End bp | 1280650 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641541587 |
Product | virulence-associated E family protein |
Protein accession | YP_001662867 |
Protein GI | 167039882 |
COG category | [R] General function prediction only |
COG ID | [COG5545] Predicted P-loop ATPase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000496901 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAG CGGTTGGTAA CAGCCGAATG GATAAAAAGT GGAAGAACAA TGATATCACA TGGGAGGACT TTATCTCACG AGTTAAATCT ACCATACGAA CAACAGAAAC GGTATCTGAA TTTCGGAAAA TGAGTCGCGC TCAACAGGAC TCAATAAAAG ATGTAGGAGG ATTTGTGGGA GGAGCCCTTC GTGAAGGAAA ACGCAGAAAT GGTTATGTCC TCTCCCGTTC CCTTCTGACT TTAGATATGG ATTATGCCAA ACCAGAGGTT TGGGAGCAAA TTGAAGCGTT ACATGATTTC AAATGCTGCA TCTATTCTAC CCATAAACAT ACACCAGATG CACCAAGATT AAGACTTATT ATTCCACTTA AAAGAGAAGT AACAGAGGAT GAATACCCAG CCCTTGGTCG GATGGTTGCA AAGGAGATTG GGATTGATTT ATTCGATGAC ACCACTTATG AACCTTCAAG ACTAATGTAT TGGCCTTCTA CACCGTCAGA TGGAGAGTTT GTATTTAAAG AGAAAGACGG TGAACTATTG GACCCAGATA TCTATCTATC AAAATATGTA GACTGGCGGG ACACTTCAAT GTGGCCAGTC TCAAGTCGAC AATCTGAGGT TGTACAAAGG AAAATAACTA AACAAGCAGA TCCATTAATT AAAGAAGGGG TTATAGGAGC ATTTTGCAGA GCCTATAACA TTGAAGAAGC CATCGAAGAA TTTCTACCTG ATGTATATGA ACCTAGTACC ATGAATGGAC GATTTGATTA TATTCCAGCA GATTCTTCAG CAGGCTTGGT AATCTATGAT GGTAAATTTG CTTATAGCCA TCATGCTACC GATCCAGCAT GTGGAATGTT GCTAAACGCA TTTGATTTGG TCCGAGTGCA TAAGTTTCGG GAACTAGATG AAAAGGTGAC AGAAAATACA CCGCCAAGTA AACTACCTTC ATTTAGAGCC ATGACAGATT TGGCATTAGA GGATGAACGG GTAAAAGAGC AGTTTGCTGA AGAACGAAAG GCTCAGGTTG AAAAAGAGTT TATTGATGAA GATTGGGAAA AGCAGCTGGA GATTGATAAG ACAGGAACGG TTAAAAATAC TCTTAGAAAT TTGATTTTGA TACTTGAAAA TGATCCCAAC CTAAAAAGCA TAGTGTTTAA TCAGCTATCT GATAGTCTCG AAATTAAAGG TGATGTTCCT TGGCCGCATC CATCGAAGTT CTGGAGAGAT GCAGATGATG CACAGCTAAT CAGCTACATT GACACCCACT ACGGAACCTT TTCTGCAAGA AACTATGATG TAGCGGTAGC GAAGGTCGCT GACGATAGAT CTTATCATCC GATTCGGGAA TTTATTGAGG CTCTTCCTGA GTGGGATAAG GTAGCTAGAG TAGATACCTT GCTAATCGAT TATCTAGGTG CATCAGATAA TCCATATGTT CGAGCTGTGA CAAGAAAAAC TTTATGTGCA GCGATTTCTC GTGTACTGAC TCCAGGCATC AAGTTTGATT CCATGTTGGT TTTAAATGGC CCGCAGGGAG TTGGAAAAAG TACTCTTATA GCCAAGTTAG GTGGAGACTG GTTTTCTGAT AGTTTGAGCT TGTCAGATAC CAAGGATAAG ACCGCCGCAG AAAAGTTACA AGGTTACTGG ATTTTAGAAA TTGGAGAGCT GGCTGGACTA AAAAAAGCAG AAGTAGAAAC ACTTAGAAGC TTCTTATCTC GCCAGAATGA TATTTATCGA GCTAGCTTTG GCAAGAGAGC TACTCCTCAC TTAAGACAAT GTGTCTTTTT TGGCACCACT AATGCTGAAA AAGGCTATTT ACGGGATACC ACAGGAAACC GTCGTTTTTG GCCGGTAAAG ACCCCGGGAA ATGGCACAAA AAAATCTTGG CAGCTAAAGC AGGATGAAAT TCTGCAGATA TGGGCCGAGG CTCTTACCTA TGTAAAGGCT GGAGAGAAAT TGTACCTTGA TGCTAGCCTT GAAAAGCTTG CAAAAGAAGA ACAGCGAGAA GCTATGGAGT CCGATGAGCG AGAAGGTTTG GTTAGAGAGT ACCTTGATAT GCTTTTACCT GAGGATTGGG ACACCATGGA TCTATATGAA CGTCGAGCCT ATATCAACGG GACTGAGTTT GGTGAAAGCC AAAGGGTTGG TGTTTGGAAA CGAAAATCTG TTTCTAATAT GGAAATTTGG TGTGAATGCT TTGGAAAGGA TCGAGCCAAC CTTCGAAGAG TAGATGGTAA TGAAATATCA GCTATTATGG CGAGTATTGG AGGATGGACA GGTCTCGTTA AAAAAGAACG TATCCCGCTT TATGGACCAC AGTGGGTTTA TGTTCCAAAA GAGTAA
|
Protein sequence | MKIAVGNSRM DKKWKNNDIT WEDFISRVKS TIRTTETVSE FRKMSRAQQD SIKDVGGFVG GALREGKRRN GYVLSRSLLT LDMDYAKPEV WEQIEALHDF KCCIYSTHKH TPDAPRLRLI IPLKREVTED EYPALGRMVA KEIGIDLFDD TTYEPSRLMY WPSTPSDGEF VFKEKDGELL DPDIYLSKYV DWRDTSMWPV SSRQSEVVQR KITKQADPLI KEGVIGAFCR AYNIEEAIEE FLPDVYEPST MNGRFDYIPA DSSAGLVIYD GKFAYSHHAT DPACGMLLNA FDLVRVHKFR ELDEKVTENT PPSKLPSFRA MTDLALEDER VKEQFAEERK AQVEKEFIDE DWEKQLEIDK TGTVKNTLRN LILILENDPN LKSIVFNQLS DSLEIKGDVP WPHPSKFWRD ADDAQLISYI DTHYGTFSAR NYDVAVAKVA DDRSYHPIRE FIEALPEWDK VARVDTLLID YLGASDNPYV RAVTRKTLCA AISRVLTPGI KFDSMLVLNG PQGVGKSTLI AKLGGDWFSD SLSLSDTKDK TAAEKLQGYW ILEIGELAGL KKAEVETLRS FLSRQNDIYR ASFGKRATPH LRQCVFFGTT NAEKGYLRDT TGNRRFWPVK TPGNGTKKSW QLKQDEILQI WAEALTYVKA GEKLYLDASL EKLAKEEQRE AMESDEREGL VREYLDMLLP EDWDTMDLYE RRAYINGTEF GESQRVGVWK RKSVSNMEIW CECFGKDRAN LRRVDGNEIS AIMASIGGWT GLVKKERIPL YGPQWVYVPK E
|
| |