Gene Athe_0109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0109 
Symbol 
ID7408471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp132624 
End bp133883 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content34% 
IMG OID643714517 
Productprotein of unknown function DUF214 
Protein accessionYP_002572040 
Protein GI222528158 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTT CAGACATACT TTCTTTGATT GTGACAAACA TAAAAAGAAG AAAACTTCGT 
ACTGCGCTTA CCGTTATGGG AATATTTATT GGAAGTTTGG GACTTTTTGT TGTTGTGTCA
ATCAGTACAT CTTTTAAGGA TTATATAGTA AAAGGCATTT CAAGCTTGGG AAATGCTGAT
GTCATATATG TTATGCCTAA CACCAATGCA GGGTATACTC TTGAAAAGTT GAAAACTGAA
ATTCACGACA AAGACATAAA AAAACTTGAA AAACTCAGAC ATGTAAAGTT TGTCATTCCT
TTTTATTTTA CTAATGGAAA CCTGAAATTT AAGAAATTTG AAGGCACAGT AACACTTGTT
GCAACATCTG TCAAAGAATT TTCAAAAAAA TACACTCTGC AGTTTGGCAG GTTCCCGAAA
GATGATAATG AAAGCGGATG CATACTTGGT TATGGGATTG CAAAACTGAT TGCCAATCCT
TCTAAAGGAG GTTTTGCAGA TGAAAATGAG GTTAAAAAGC TTGTGAACAA GGCTATAAAG
ATTGAGAGCA AAAGAATCAA TCAGGCAGGT GAAGAAGAAA CAAAAGAGTT TTCATTTAAA
ATAAGAGGAA TTGCAAAGAG TGATTTTAAT TTTGATTCTT CTATAATTCT GCCAATGAAA
GCTATGGATA AGATTGAAGA CTGGAGATAT TCTCAGCAGG ATTTTATCAA AAAGACTGGA
TATACCTATA CATTTTTAGT TGTAGACAGT CCTTCGCACA TACCTGAGGT GGAAAAATTC
TTAGAAAGAG AAAAATACTA CTATACCTCA ATCAAAGAAC AGCAAGAGGT TATCGAAAAG
TTTTTAAATG CGGTAAAAAT CATAGTTGGC GGAATTGGAG CAATATCACT GGTTGTTGCA
GCTTTTGGTA TTGCAAATAC AATGATAATG GCAATTTTAG AGAGGCGAAA AGAAATTGGG
ATATTTAAAG TATTAGGTGC AAGTTCTAAA AACATCTTGC TTTTGTTTCT TTTTGAATCA
GGCTTTCTGG GTTTTTTGGG CGGTGTTTTT TCTGTAATAG CTGGATTTGC ATTGAATTTT
TTGATAGGTC TTGTGCTAAG GGCACGCTTC CCAGCCATAA ACGACTTTAG TATCGGTTTT
AACATTCCAC TTGCCTTGTT TGTTTTATGC ATTTCAACCC TAGTTGGCAT TATTGCCGGG
ATTTACCCTG CTAAAAAAGC AGTCTCGATT GAAGTAATCT CTGCATTGAA AGAAGAATAA
 
Protein sequence
MKFSDILSLI VTNIKRRKLR TALTVMGIFI GSLGLFVVVS ISTSFKDYIV KGISSLGNAD 
VIYVMPNTNA GYTLEKLKTE IHDKDIKKLE KLRHVKFVIP FYFTNGNLKF KKFEGTVTLV
ATSVKEFSKK YTLQFGRFPK DDNESGCILG YGIAKLIANP SKGGFADENE VKKLVNKAIK
IESKRINQAG EEETKEFSFK IRGIAKSDFN FDSSIILPMK AMDKIEDWRY SQQDFIKKTG
YTYTFLVVDS PSHIPEVEKF LEREKYYYTS IKEQQEVIEK FLNAVKIIVG GIGAISLVVA
AFGIANTMIM AILERRKEIG IFKVLGASSK NILLLFLFES GFLGFLGGVF SVIAGFALNF
LIGLVLRARF PAINDFSIGF NIPLALFVLC ISTLVGIIAG IYPAKKAVSI EVISALKEE