Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1823 |
Symbol | |
ID | 7408937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1896036 |
End bp | 1897793 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643716200 |
Product | Fibronectin-binding A domain protein |
Protein accession | YP_002573689 |
Protein GI | 222529807 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATTTG ACGGAGTTGT TCTAAGTGCT CTTAAAAAAG AATTAATTTT GGAGCTTGTA GATGGTAAAG TTGAAAGAAT ATATCAGCCA AATCAGTTTG AAGTCAATCT TTATGTTTAT AAGCTCGGAA AAACAAAAAA ACTCATTATC TCCGCAAATC CCTCTTTGCC AAGGATATAC ATCACAGAAA GGCAAAAGAA AAACCCAGAA GTTGCTCCAA ATTTTTGCAT GATTTTGCGC AAGAATTTGC TCGGAGCAAG GCTTGTCGGA ATTTATCAGC AAGGTTTAGA AAGAATTTTG CAAATAGAAT TTGAAACAAA AAGTGAACTT GGTGACACAG AAGTAAAGTA TCTTATATTT GAAATGATGG GGAGACACAG CAATATATTC TTGGTAGATT CCAACTATAA AATTATTGAT GCTATAAGAA GATTGTCATT CGAAGATTCA CCAAGACCAA TTTTACCCGG AGTCAAATAT ACATTGCCGC CAGTTTTGAC AAAGAAAAAT CCTATTGAAG TTTCGTTTGA TGAATTTATA TCATTTTTTA AATCCTCAAA TAAAAGTCCA GAAAATATAC TGACCGACAA TCTTTCAGGA ATTAGCAAAC AATTTGCTAA TGAAGTTATC TTGCGTGCAC AAGTTTTTGA AAAAAGTCTT GAAAATAAGG ATACAATTAA AAGGATTTTT GATTCTTTAA AAGAATTATT ATATTGTATA GTCGAAAAAG GGGAGATACT TCCAACACTC TATACTGAAA AAGGAAATGT AGTTGATTTT TATGTGATTG ACCTGAAATG TTTTTCTTCT TTTCCCAAAA AACATTTTTC AAATTTAAAT TTGTGTATAG ACGAATACTA TTTTAAAAAA GAGCAACATA CAGTATTTAT TGAAAAACGT CAACACCTTC AGAAGATTAT AGAACAAAAT GTAAAAAAGC TGAGTCAAAA ATATGATCAG AACATTCAAA AAATACAAGA GGCTAAAAAT GCTGAGGTGT ACAGAAAATA TGGTGACCTA ATTTTAGCAA ATCTTTACCA GCTCAGAGAA ACAAATGAGG ATTTTGTTGA GGTTATTGAT TATTACAGTG AAGATTTATC TACTATGAAG ATTCCGCTTG AAAAAGACAA AGATTTGAAA CAAAATGCCG AGAGGTATTA TAAGCTTTAC AATAAGCTCA AAAAAGCTGA AGAGTATGCT AAAAATGAAA TTGCTGAAAT TGAAAAAGAA ATTGAATTTC TGCAAAGTTT AGAAGCACTG CTTGAAAAAA GCCAAGAGAT AGAAGACCTT TTGAGTATAG AAGAAGAGTT AGAAAAAGAA GGTTATATCA AAACTCAGGT AGAAAACGTA GGTCAGCAAA AGAAAAAAGA AAATCAAAAA TCAAAACCTC ACCACTTTAT CAGCTCAGAT GGATTTGACA TATATGTGGG AAGAAACAAT CTGCAGAACG ATTTTCTCAC CATAAGATTT GCTTCAAGCC ATGACATCTG GCTTCACACC CAAAAGATTC CCGGCTCTCA TGTTATAATT CGAACAAACA ACAAAGAAGT CCCGCAAACA ACCTTGGTTG AAGCTGCACT TCTTGCAAGC TACTTTAGCA AAGCCAAGCA TTCAACAAAA GTGCCGGTTG ACTATACATT TGTAAAGTAT GTAAAAAAGC CACCTAAATC CAAGCCAGGT TTTGTTATAT ACGACAACTT TAAAACTATC ATTGTTGATT CACCTGAAAA TATTGATAAC TTCAACAAAG TTGAGTAA
|
Protein sequence | MPFDGVVLSA LKKELILELV DGKVERIYQP NQFEVNLYVY KLGKTKKLII SANPSLPRIY ITERQKKNPE VAPNFCMILR KNLLGARLVG IYQQGLERIL QIEFETKSEL GDTEVKYLIF EMMGRHSNIF LVDSNYKIID AIRRLSFEDS PRPILPGVKY TLPPVLTKKN PIEVSFDEFI SFFKSSNKSP ENILTDNLSG ISKQFANEVI LRAQVFEKSL ENKDTIKRIF DSLKELLYCI VEKGEILPTL YTEKGNVVDF YVIDLKCFSS FPKKHFSNLN LCIDEYYFKK EQHTVFIEKR QHLQKIIEQN VKKLSQKYDQ NIQKIQEAKN AEVYRKYGDL ILANLYQLRE TNEDFVEVID YYSEDLSTMK IPLEKDKDLK QNAERYYKLY NKLKKAEEYA KNEIAEIEKE IEFLQSLEAL LEKSQEIEDL LSIEEELEKE GYIKTQVENV GQQKKKENQK SKPHHFISSD GFDIYVGRNN LQNDFLTIRF ASSHDIWLHT QKIPGSHVII RTNNKEVPQT TLVEAALLAS YFSKAKHSTK VPVDYTFVKY VKKPPKSKPG FVIYDNFKTI IVDSPENIDN FNKVE
|
| |