Gene Athe_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0449 
Symbol 
ID7407526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp511043 
End bp512188 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content33% 
IMG OID643714836 
Productprotein of unknown function DUF58 
Protein accessionYP_002572354 
Protein GI222528472 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCT TTTGGCTGTT TATAATATGC TCTTTATTAT TTGCTCTAAA TTTTTTCGTT 
ACTAAAAAAT TTGGACTAAA AAACGTTGAG TATGAAATCT ACTTTGAAGA AAATAAAAAA
ACTGAAGGTG ATGAGATTCA TATTGTTGAA AGAATTTACA ACGGTAAAGC TCTGCCACTT
CCATGGGTAA AATCCGAATT TGAAGTATCA GCATCATTTT TCATGGAAAA TGCAAAAAAT
TATGTAGTAG GAAATAAGCT AAGGTACATC AGCATTTTCT TTCTTCTACC TTACCAGCAA
ATAGTAAGGC GGCACAGATT TGTTGCAACA AAAAGAGGAT TTTATAAACT TGATAAAATA
TATCTTGTCA CTGGTGACCT TTTCGGTCTT TCAATGGATG ACAGGTGTTA CTATGTAAAT
TCCAATATTA CCATTTACCC AGCATTTTTG GACCTGAAAA AACACCTTCT GCCCCGTTCA
AGCCTCTCAG GCGAAGTTGT GATAAAAAGA CATTATTATG AAGATATATT TCACTTTGCA
GGAATAAGAG AGTATCAGTC TTTTGATTCT TTCAATAGAA TAAACTGGAA CGCAACTGCA
AAGTATAATA CTTTGATGGT AAACAAGTAC GAATACACCT CATCAGGTGA TGCTTTAATA
CTTTTGAATG TCCAAAGTTC AGAGTATGAA AGAAAAGAGG TTTTTAACAA AAACGCAATC
GAACTTGGAA TAAAGATTGC AGCAAGCCTG ACAAAAGAAT GCTTAGATAA TCACATTCCA
GTTGGTTTTG TTTGCAATGG CATAGACGAA GAAACTCTTG AGCCGCTTGA AATCTTGCTG
CCATCACAAG ATTCAAATCA GCTTTTAAAA ATTCTCGAAA CACTTGCACA CATTAAAATT
CAGGTAAACG AATACTTTGA AGCTTTGCTT TATCAAGTTT TAAGAAGTTA CAACTTTCGT
GAGCTTTTTA TAATAACTTC TTTTGTTAAC AAGGAGATGG AAGACTCTAT CCTTCTTTAT
TCCTCACTCG GAGTTAAGTT TACTATTATT CTTCTTGAAT ATGATGAAAA ACCTTTCAAA
TTAGAATCAG AAAATGTAAG AATTTTTCTG GCAAAACAGC ATCTTTTAGA AAACGTTAGA
ACTTGA
 
Protein sequence
METFWLFIIC SLLFALNFFV TKKFGLKNVE YEIYFEENKK TEGDEIHIVE RIYNGKALPL 
PWVKSEFEVS ASFFMENAKN YVVGNKLRYI SIFFLLPYQQ IVRRHRFVAT KRGFYKLDKI
YLVTGDLFGL SMDDRCYYVN SNITIYPAFL DLKKHLLPRS SLSGEVVIKR HYYEDIFHFA
GIREYQSFDS FNRINWNATA KYNTLMVNKY EYTSSGDALI LLNVQSSEYE RKEVFNKNAI
ELGIKIAASL TKECLDNHIP VGFVCNGIDE ETLEPLEILL PSQDSNQLLK ILETLAHIKI
QVNEYFEALL YQVLRSYNFR ELFIITSFVN KEMEDSILLY SSLGVKFTII LLEYDEKPFK
LESENVRIFL AKQHLLENVR T