Gene Athe_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1823 
Symbol 
ID7408937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1896036 
End bp1897793 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content32% 
IMG OID643716200 
ProductFibronectin-binding A domain protein 
Protein accessionYP_002573689 
Protein GI222529807 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATTTG ACGGAGTTGT TCTAAGTGCT CTTAAAAAAG AATTAATTTT GGAGCTTGTA 
GATGGTAAAG TTGAAAGAAT ATATCAGCCA AATCAGTTTG AAGTCAATCT TTATGTTTAT
AAGCTCGGAA AAACAAAAAA ACTCATTATC TCCGCAAATC CCTCTTTGCC AAGGATATAC
ATCACAGAAA GGCAAAAGAA AAACCCAGAA GTTGCTCCAA ATTTTTGCAT GATTTTGCGC
AAGAATTTGC TCGGAGCAAG GCTTGTCGGA ATTTATCAGC AAGGTTTAGA AAGAATTTTG
CAAATAGAAT TTGAAACAAA AAGTGAACTT GGTGACACAG AAGTAAAGTA TCTTATATTT
GAAATGATGG GGAGACACAG CAATATATTC TTGGTAGATT CCAACTATAA AATTATTGAT
GCTATAAGAA GATTGTCATT CGAAGATTCA CCAAGACCAA TTTTACCCGG AGTCAAATAT
ACATTGCCGC CAGTTTTGAC AAAGAAAAAT CCTATTGAAG TTTCGTTTGA TGAATTTATA
TCATTTTTTA AATCCTCAAA TAAAAGTCCA GAAAATATAC TGACCGACAA TCTTTCAGGA
ATTAGCAAAC AATTTGCTAA TGAAGTTATC TTGCGTGCAC AAGTTTTTGA AAAAAGTCTT
GAAAATAAGG ATACAATTAA AAGGATTTTT GATTCTTTAA AAGAATTATT ATATTGTATA
GTCGAAAAAG GGGAGATACT TCCAACACTC TATACTGAAA AAGGAAATGT AGTTGATTTT
TATGTGATTG ACCTGAAATG TTTTTCTTCT TTTCCCAAAA AACATTTTTC AAATTTAAAT
TTGTGTATAG ACGAATACTA TTTTAAAAAA GAGCAACATA CAGTATTTAT TGAAAAACGT
CAACACCTTC AGAAGATTAT AGAACAAAAT GTAAAAAAGC TGAGTCAAAA ATATGATCAG
AACATTCAAA AAATACAAGA GGCTAAAAAT GCTGAGGTGT ACAGAAAATA TGGTGACCTA
ATTTTAGCAA ATCTTTACCA GCTCAGAGAA ACAAATGAGG ATTTTGTTGA GGTTATTGAT
TATTACAGTG AAGATTTATC TACTATGAAG ATTCCGCTTG AAAAAGACAA AGATTTGAAA
CAAAATGCCG AGAGGTATTA TAAGCTTTAC AATAAGCTCA AAAAAGCTGA AGAGTATGCT
AAAAATGAAA TTGCTGAAAT TGAAAAAGAA ATTGAATTTC TGCAAAGTTT AGAAGCACTG
CTTGAAAAAA GCCAAGAGAT AGAAGACCTT TTGAGTATAG AAGAAGAGTT AGAAAAAGAA
GGTTATATCA AAACTCAGGT AGAAAACGTA GGTCAGCAAA AGAAAAAAGA AAATCAAAAA
TCAAAACCTC ACCACTTTAT CAGCTCAGAT GGATTTGACA TATATGTGGG AAGAAACAAT
CTGCAGAACG ATTTTCTCAC CATAAGATTT GCTTCAAGCC ATGACATCTG GCTTCACACC
CAAAAGATTC CCGGCTCTCA TGTTATAATT CGAACAAACA ACAAAGAAGT CCCGCAAACA
ACCTTGGTTG AAGCTGCACT TCTTGCAAGC TACTTTAGCA AAGCCAAGCA TTCAACAAAA
GTGCCGGTTG ACTATACATT TGTAAAGTAT GTAAAAAAGC CACCTAAATC CAAGCCAGGT
TTTGTTATAT ACGACAACTT TAAAACTATC ATTGTTGATT CACCTGAAAA TATTGATAAC
TTCAACAAAG TTGAGTAA
 
Protein sequence
MPFDGVVLSA LKKELILELV DGKVERIYQP NQFEVNLYVY KLGKTKKLII SANPSLPRIY 
ITERQKKNPE VAPNFCMILR KNLLGARLVG IYQQGLERIL QIEFETKSEL GDTEVKYLIF
EMMGRHSNIF LVDSNYKIID AIRRLSFEDS PRPILPGVKY TLPPVLTKKN PIEVSFDEFI
SFFKSSNKSP ENILTDNLSG ISKQFANEVI LRAQVFEKSL ENKDTIKRIF DSLKELLYCI
VEKGEILPTL YTEKGNVVDF YVIDLKCFSS FPKKHFSNLN LCIDEYYFKK EQHTVFIEKR
QHLQKIIEQN VKKLSQKYDQ NIQKIQEAKN AEVYRKYGDL ILANLYQLRE TNEDFVEVID
YYSEDLSTMK IPLEKDKDLK QNAERYYKLY NKLKKAEEYA KNEIAEIEKE IEFLQSLEAL
LEKSQEIEDL LSIEEELEKE GYIKTQVENV GQQKKKENQK SKPHHFISSD GFDIYVGRNN
LQNDFLTIRF ASSHDIWLHT QKIPGSHVII RTNNKEVPQT TLVEAALLAS YFSKAKHSTK
VPVDYTFVKY VKKPPKSKPG FVIYDNFKTI IVDSPENIDN FNKVE