Gene Athe_0517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0517 
Symbol 
ID7408641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp584660 
End bp585898 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content37% 
IMG OID643714899 
Productprotein of unknown function DUF195 
Protein accessionYP_002572416 
Protein GI222528534 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0973861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAGTAG TTTTGCTAAT TGTTGCTATT GTTCTTGTCA TATCAAACTT GATTTTGTTA 
ATAAGGCTTA AAAATAATAT AAATTCTTCT TTGGATACTC AAAATAAACT GTTAGAGATT
GAAAAAGAAC TTGAACAAAT TCAAAATTCT ATCTCACAGC AATTTTCTCA GAATAAAAAT
GAAATGCAAA ATATAATAAG CTCATTTGGC AGCATTTTAA TGACAAGATT TTCAGATCTA
TCCAATCAGA TAATAAATTT TACATCATCA AGTCAGGAAA GGCTTGACAG TATCCGAAAA
GAGATAGATA GTAAGCTTGA GAAAATACGA GAGACTGTTG ACAGCCAGCT ACAAAGCACA
TTAGAGACAA AACTTTCGCA GTCTTTCAAG CTTGTATCAG AGCGTCTGGA GCTTGTCCAC
AGAGGGCTTG GTGAGATGCA GGCCCTGGCC GGAAGTGTTG GAGACCTTAA AAAGATTTTG
AGCAATGTAA AGGTTCGTGG AACACTTGGT GAGATTCAGC TTGGCAATAT CATAGACCAG
ATTTTGGATC AATCACAGTA CGAAAGAAAT GTCAGGATAA AACCGCACAC TCAAGAGCAA
GTTGAGTTTG CAATAAAGAT TCCTTCTAAA AATTCAAAAG ATAATGAATT TATATACCTT
CCAATAGACT CCAAATTCCC CATAGAAAGT TATCAGCGGC TTATTGAGGC GCAGGAGAAA
GCGGAGACAG AAGAAGTTGC AAGATTTTCG AAGGAGCTTG AAAATAGTAT AAGACAGAAT
GCAAAGACTA TAAAGGAAAA GTACATAGAC CCGCCTAAAA CAACAGATTT TGCTATCATG
TTTTTGCCCT CTGAAGGGCT TTATGCAGAG GTGCTGAAGA TACCCGGGCT GTTTGAGTCT
GTGCAAAGGG AATACAAGGT AATTATTGCA GGACCTACAA CAGTTGTTGC AATGCTCAAC
ACCATTTCGC TTGGATTTAA AACTTTTGCT ATTGAAAAGA GAACAAATGA GATCTGGGAG
CTTTTGTCTG CCGTCAAGAC TGAGTTTTCA AGGTTTGCTG AGATTCTTGA AAAGGTTAAA
AAGAAGCTTT CTGAAGCGCA GGATACAATT GACACTGCAA CAAGAAAGAC AAGAACTATA
GAAAGAAAGC TTAAAAGTGT TGAGACCCTC TCTTCAGAAA AAGATATAGA TATGATTCTT
TATGATGAGG AAGCTATTGA AGAAGGTTCA GGGAAATAA
 
Protein sequence
MEVVLLIVAI VLVISNLILL IRLKNNINSS LDTQNKLLEI EKELEQIQNS ISQQFSQNKN 
EMQNIISSFG SILMTRFSDL SNQIINFTSS SQERLDSIRK EIDSKLEKIR ETVDSQLQST
LETKLSQSFK LVSERLELVH RGLGEMQALA GSVGDLKKIL SNVKVRGTLG EIQLGNIIDQ
ILDQSQYERN VRIKPHTQEQ VEFAIKIPSK NSKDNEFIYL PIDSKFPIES YQRLIEAQEK
AETEEVARFS KELENSIRQN AKTIKEKYID PPKTTDFAIM FLPSEGLYAE VLKIPGLFES
VQREYKVIIA GPTTVVAMLN TISLGFKTFA IEKRTNEIWE LLSAVKTEFS RFAEILEKVK
KKLSEAQDTI DTATRKTRTI ERKLKSVETL SSEKDIDMIL YDEEAIEEGS GK