Gene Athe_0661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0661 
Symbol 
ID7407085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp745774 
End bp746925 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content38% 
IMG OID643715042 
Producthypothetical protein 
Protein accessionYP_002572558 
Protein GI222528676 
COG category[S] Function unknown 
COG ID[COG3581] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.030634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAT CATTTCCTTA CATGGGCTCT GCCATAGTGT ATAAAAAACT TTTTGAGTTT 
TTGGAACATG AAGTGATAAT GCCGCCAAAA CCAACACAGA GGACCATGGA CCTTGGCGTC
AAATACTCAC CAGAGTTTGC TTGCATTCCT CTTAAAATGG TCATGGGGAC ATATTTAGAG
GCCATTGAAA AAGGTGCAAA AGTGCTTGTG ACATCTGGTG GACATGGTCC ATGCAGAGCA
GGTTTTTATG GTGACACTCA CAGGAACATA TTAAAATCTC TTGGCTATGA CGATGTTGAG
CTTATAATCT TTGATGCTCC ACAGGATAAC TGGAGGGCAT TTTTAAGAAA CGTTCAAAAA
ATCAGAAATG GAGTTCCATG GCACAAGGTT ATAAACAGGA TGTACACCTT ATACAGATTT
GTCCAGAAGC TTGATGAGCT TGAAAAGATG GTTCAAAAAA TAAGACCATA TGAAGTCAAC
AAAGGTCAGA CAACTCAGGT TTGGAATCAA ATCCAAGAAA AGTTTGACAA AATAAAGACA
AGAAAAGAAC TGTATAGAGT TTATGAAGAG TGCAAGCAGA TGCTTCTTAG TATCCCAACA
AGAAAGGTTG ATGAAAAAGA CAGAATAAGA GTTGGGATTG TAGGCGAGAT TTATGTTGTG
ATGGAAAGCT CTATTAACTT TGGGATAGAA GAGATTTTGG GCAATCTTGG GGTTGAGGTA
GAAAGAAGCT TGTATCTTTT TGAGTGGATA AACGACAATC TGGTTCCATG GATTTTGAGA
CCAAAGAGGT TTAAAGAGAT TATAAAAAAG GGTCAAAGAT ATATCAAGAT TTTAATTGGT
GGTCATGCGG TTGAGACTGT GGGACATATT ATAGACTTTA AAGAGAGAGG ATTTGACGGA
ATTGTTCATC TTATGCCCTT TGCATGTTTG CCAGAACTTG TAACCCAGAG TTTAATTCCA
AAGATATCGA AAGAGATTGA TATTCCAATT CTGTCGCTTC CAATAGATGA GCAGACAGGA
AAGGCAAATA TGCTCACCAG GATAGAAGCT TTCATTGACC TTTTGAGAAA TAGGAAAAGA
GGAAAAACAA AAGAAGTCTT TATTGACAAC ATACAAGAAC ATGTTCAGGA AGAAAGGGTT
GTAATGGTAT GA
 
Protein sequence
MKVSFPYMGS AIVYKKLFEF LEHEVIMPPK PTQRTMDLGV KYSPEFACIP LKMVMGTYLE 
AIEKGAKVLV TSGGHGPCRA GFYGDTHRNI LKSLGYDDVE LIIFDAPQDN WRAFLRNVQK
IRNGVPWHKV INRMYTLYRF VQKLDELEKM VQKIRPYEVN KGQTTQVWNQ IQEKFDKIKT
RKELYRVYEE CKQMLLSIPT RKVDEKDRIR VGIVGEIYVV MESSINFGIE EILGNLGVEV
ERSLYLFEWI NDNLVPWILR PKRFKEIIKK GQRYIKILIG GHAVETVGHI IDFKERGFDG
IVHLMPFACL PELVTQSLIP KISKEIDIPI LSLPIDEQTG KANMLTRIEA FIDLLRNRKR
GKTKEVFIDN IQEHVQEERV VMV