Gene Athe_2215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2215 
Symbol 
ID7408412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2345364 
End bp2346356 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content36% 
IMG OID643716583 
ProductApbE family lipoprotein 
Protein accessionYP_002574062 
Protein GI222530180 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0332053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACAG CCTTGATTTC AAATGAGACA AAGATTATAA AATCAATGTT TGCACTCGGC 
ACAGACATTC ATTTTATTTT TTACCAATCA AACTTTGAAA GTGCACTTGA CAGGGCCCAC
AGTCTCATTT TAGATATGGA AAATAAATTG TCGGTTTTCA AGCCAAAAAG TTTAGTAGCA
AAATTAAATA GATACGGAAA TTACATCCCC ATAAAGGTTT GCCCGGAGGT TTATGAGCTT
ATAAAAAAGT CGGTTGAGTA CAGCCTATTT TCAGAAGGTT ATTTTGATAT AACGGTAAAA
AGACTTATAG ATATGTGGAA AGAAGCAAAA CAAAAAAATA AGATGCCGTC AAAAGAAGAA
ATAGAACTTG CTCTCACTTT TTCAGGCTCA GAAAACATAC AGCTTTTATC AAACTATAGA
GTAAAGCTCA AAAACAAAGT CAAACTTGAC TTTGGAGCTA TTGCCAAAGG CTTTCTTGCA
GACAAAATAC GTGAGATTTT TGAACAGGAA GGTATAAATT CAGCAATTGT CGACCTTGGC
GGGCATATAC TGACAGTTGG GAAAAAACAT GATGAGAGCC TTTGGAAGGT GGGAATTCGG
CATCCTTTTA AAACAAGAGA AGATGTGCTG GGTTTTTTAG AGCTTGGTAG TACTTCAGTT
GTAACATCCG CAAGTTATGA AAGGTATTTT ACAATTGATG GCAAAAAACT TTCACACATA
ATCAATCCAA AAACAGGATT TCCTGTAAAA GATGACATTG CAAGTATAAC CGTTGTTGAC
ACAAACTCAA CATTTGCAGA TGCGATGTCA ACTGCCCTTT TTGCAATGGG ATTTAAAAAG
GCCATAAATT TCATACAGGA CAGCAAGACT ATTGAAGCAG TGGTTGCTAC TTCTTTTCGA
GAAATATATA TAACACCAGG GCTTGCACAA AGGTTTACCC TGTGTGATAG CTCTTTCAGA
ATTATTAAGA CAAATGAGGT GATTGTTCTG TGA
 
Protein sequence
MDTALISNET KIIKSMFALG TDIHFIFYQS NFESALDRAH SLILDMENKL SVFKPKSLVA 
KLNRYGNYIP IKVCPEVYEL IKKSVEYSLF SEGYFDITVK RLIDMWKEAK QKNKMPSKEE
IELALTFSGS ENIQLLSNYR VKLKNKVKLD FGAIAKGFLA DKIREIFEQE GINSAIVDLG
GHILTVGKKH DESLWKVGIR HPFKTREDVL GFLELGSTSV VTSASYERYF TIDGKKLSHI
INPKTGFPVK DDIASITVVD TNSTFADAMS TALFAMGFKK AINFIQDSKT IEAVVATSFR
EIYITPGLAQ RFTLCDSSFR IIKTNEVIVL