Gene Athe_0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0535 
Symbol 
ID7408660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp605538 
End bp606566 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content38% 
IMG OID643714917 
Product3D domain protein 
Protein accessionYP_002572434 
Protein GI222528552 
COG category[S] Function unknown 
COG ID[COG3583] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000418011 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT TTAAGTGCCT TGTGGCAAAG CCAAGGGATA TGAAAAAGCT TATCCTGGCT 
TTTGTCATTG TATTTGTTTT GTCAGTCCTG CTTGGCGCCA TGACTGCACA GGCACTGGTG
AAAGAAGTTA GCATAACAAT TGACGGCAAG ACATTTTATT ATAAAACGAT TAAATCCACA
GTAAGAGAGG TTTTAGAAGA AAATCAAATT TACTTGACAA AAGATGACTA TGTCTCGCCT
TCTTTGGATT CAAAAATAAA TGAAAATACC CAGATAATAA TAAAAAGAGC TTTTGAAGTG
AAAATACTTG TTGGCGACGA GGAAAAAGTT GTATATATTC CAAGCGGTAC TGTTGAGGAT
GCTATCAAAA AAGCTGGAGT TGTTCTTGGA AAGTTGGACA AGATAAATCT TCCTCTCTCT
CAGCTTCTTG ATAAGTCAAC TGTCATTAAA ATTACTAAGG TGACAGAGAA GGTGGTCGTA
GAAAAACAAA AAATACCTTT CAGTACAGTG ACAAAAATAA ACTATAATAT GGACTACGGA
AAGCAAAAGG TTATCCAGCA AGGGCAGGAT GGTATTAAAG AAAGAAGATA CAAAGTTGTC
TTGGAAGATG GTAAAGAAGT TGAGAGAAAG TTGATTGAGG AAAGAGTTGT CAAAAATTCG
AAGCCGAGGA TTGTTGAAGT TGGAGCAATA AGGTGGTTCA AGACATCAAG AGGAGAAGTG
GTCAGATACA GAAAAGTTTA TACAATGATA GCAACTGCAT ATTCTTTGAC CCCAAGTGAT
ACAGGAAAAA GTCCATCTCA TCCTGATTAT GGCAGAACTG CAACAGGTCA CAAAGTAAAG
CGCGGGGTTG TTGCGGTTGA CCCGCGCGTG ATTCCGCTTG GAACAAGGCT TTATATAGAA
GGATATGGTT TTGCGAGAGC TCTTGATACA GGTTCTGCTA TCAAGGGAAA CAGGATAGAT
GTGTTTGTTG AAAAGGATGC GTATAAATTT GGTGTGCGGC GCGTAAAAGT TTATGTGCTT
GCAGACTAA
 
Protein sequence
MNKFKCLVAK PRDMKKLILA FVIVFVLSVL LGAMTAQALV KEVSITIDGK TFYYKTIKST 
VREVLEENQI YLTKDDYVSP SLDSKINENT QIIIKRAFEV KILVGDEEKV VYIPSGTVED
AIKKAGVVLG KLDKINLPLS QLLDKSTVIK ITKVTEKVVV EKQKIPFSTV TKINYNMDYG
KQKVIQQGQD GIKERRYKVV LEDGKEVERK LIEERVVKNS KPRIVEVGAI RWFKTSRGEV
VRYRKVYTMI ATAYSLTPSD TGKSPSHPDY GRTATGHKVK RGVVAVDPRV IPLGTRLYIE
GYGFARALDT GSAIKGNRID VFVEKDAYKF GVRRVKVYVL AD