Gene Athe_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0427 
Symbol 
ID7407504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp487094 
End bp488209 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content35% 
IMG OID643714814 
Productsporulation integral membrane protein YtvI 
Protein accessionYP_002572332 
Protein GI222528450 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID[TIGR02872] sporulation integral membrane protein YtvI 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGGG AGTTTACAAA AAACCGGTTT GTGGTTGCTA TGATTTATGT TATCATAATT 
AGCGCTTTTA TCTTTTCATG TGCATATTTA ATCAAGACAT TTGACCTTTT TGTGAGATTT
ATGATAAAAG CATTCATTCC AGTTATAATA GGACTCTTTA TTGCAGTTGT GTTTGAACCG
CTTTTAAAAT ATATGGAAAA GAATAAAGTA AACAGGACAA TCTCTGCAAT ACTTATACTT
ATTGTCCTCA ACATAATACT TGGTTTTATG CTTGCAGAGG GCATATATAT ACTTGTGAAT
GAGTGCATGA GATTAGTTGC GAGTCTTCAA AATATTGATT ATGATAAAAT TTATCAGGCT
TTAGACAAGC TCTTTTCAAA TGTAAAAAAT ATATACTCAG GATTGCCGGG GCCTATTGTA
AATTTTATTC AATCAGGTGT TGACGAACTT ACAAATGTAC TTAACCAAAT TGCCACAATG
AGTTTAAAGG TAATAAAGGT TATACCAGCA ACCTTAAAAG GTGTCACGGT GTGGTTTTTT
TCTGTTCTGT CAGCCTTTTT CTTTATGCGC GACAGACACA AAATGAGGTC ATGGCTAATT
CAAAACTTTT CAGTCCAGAT TTACAAAGAA CTTTCATCAA TTGCTTTTAA AGTTATAGAC
TCTGTTGTAG ACTATGCAAA GTCTCAGATA ATTCTGGCAA TTTTAATGTT TCTCTCAGGG
CTTGTAGGGC TTTCTATAAT AAAGGCGCCT TACTTTTTGG TGGTAAGTCT TCTTCTGGGG
CTTATGAGCA TAATTCCAAT CATAGGTTCA GGCATAATAT TGCTTCCGTG GATTGCAGGC
AGTTTTATAG CTGGGGACAC TAATTTTGGA ATAAAACTTT TGATTGTATA TCTTATAATT
TTAGGCATTC GTGAATTTGC CTCTATCAAG ATTGTTGCGA ACCAGGTGGG GATTTCGACC
TTTACAACAC TTGTTTCTAT CTATGCAGGT GTTGAAGTGT TTGGAGCGTG GGGGTTTGTG
ATAGGTCCGC TTTTGGTTGT GTTTTTGAAA GCAGTGTATG AGACAGGTGC AATTAAAAAG
ATAAGAGAAA ATCTCTTTTT GACAAAAAAG GAGTGA
 
Protein sequence
MMREFTKNRF VVAMIYVIII SAFIFSCAYL IKTFDLFVRF MIKAFIPVII GLFIAVVFEP 
LLKYMEKNKV NRTISAILIL IVLNIILGFM LAEGIYILVN ECMRLVASLQ NIDYDKIYQA
LDKLFSNVKN IYSGLPGPIV NFIQSGVDEL TNVLNQIATM SLKVIKVIPA TLKGVTVWFF
SVLSAFFFMR DRHKMRSWLI QNFSVQIYKE LSSIAFKVID SVVDYAKSQI ILAILMFLSG
LVGLSIIKAP YFLVVSLLLG LMSIIPIIGS GIILLPWIAG SFIAGDTNFG IKLLIVYLII
LGIREFASIK IVANQVGIST FTTLVSIYAG VEVFGAWGFV IGPLLVVFLK AVYETGAIKK
IRENLFLTKK E