Gene Athe_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0389 
Symbol 
ID7409319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp441119 
End bp442330 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content32% 
IMG OID643714773 
Producthypothetical protein 
Protein accessionYP_002572296 
Protein GI222528414 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000244478 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA AAGTAGGAAG ACTTTACACA TTTTTTAGAT ACATTGTTGT AGTGCTTTTA 
GGATTAGCCA TTGTTTTGTC ATTTCAGTTT CGGATAGTTG CTCTTACACA TAAGAAAATA
AGTTTTGTTG TGATGGTTAT TCTTCTTTTG TCAGCTTTTT TAATCTTTAA TATTTGGCTT
TTTATATTTC AAAAACTGGC AAACAAAAAA CTCGCTATAC TCTTGTTAAT TCTTGTTATA
GCAGCTCCAA GACTTATATG GATTTTTTTG ATGCCAACAA AGCCAGTTTC GGACTATTTA
TGTTTTTATT CCTATGCCCA AAAAGCTTCT CAAGGCTTTT TAAAAGGGTA TGACATGACT
TTTACACTCT TTAGGTTCAG ATTTGGATAT TCATTATTTT TGGCACTTGT TTTCAAGATT
TTTGGAAGCA GCATAATTGT AGGAAAACTT TTCAATGTGT TTCTTTCAGT AGTTTTAGGA
CTTATCATAT ATTTTACTGT TGACTATCTT TTTGGCAAAG AAGCAGCCAC ATATTCAGCA
ATTTTGTATG CCTTTTGGCC ATCGCAGATA ATGTATAATT CAGTTTTGGC GTCTGAACAC
CCATTTATTG TGTTTTTTGT GCTGGGGCTG TATTTTTTGT TAAGGGCAAT AAAAGAGAAA
AAAGCCATTT TTGGCATATT TGCAGGAGTC CTTGTGGCAA TTTCAAATCA TATAAGACCT
GTTGCTGTTG TGATCATAAT TGCGATGGTT TTCTTGTTTG CACTAAAAGC TCTTTGTAAA
GATTTTAAAA TCTTGAAAAG TGCTATCTTA AGTATAATTT CATACGTTAT TACATTCTAT
ACAGTAGGAT ATCTAATTTT TTGTCTCACA GGCATTCCTG TGTGGAAAAC ATCAATGGGG
CTTAATCTCA TGATTGGTAC AGACTATACA ACATATGGTA TGAACAATCC TAAACATTCT
TTGTTTGTTA AAAAATATGC TTATGATTTT CAAAAGATGC ACGGTGAGGT TATGAAAATA
GGGTTAGAGA GACTTAAAAA AGAAACAAAA AAATTTATTG CTATTCTTCC TCGCAAACAT
GCTATTATCT GGGGCGATGA TAGCTTTGGG TATTTTTGGA GTACTTTTAA AGTTTACAAA
ACCACATATT TTGTTAATCT TGTAAAAATT CATCCAACCA TTTTCTATAT GTTCTCTCAG
CTATACTACT AA
 
Protein sequence
MQNKVGRLYT FFRYIVVVLL GLAIVLSFQF RIVALTHKKI SFVVMVILLL SAFLIFNIWL 
FIFQKLANKK LAILLLILVI AAPRLIWIFL MPTKPVSDYL CFYSYAQKAS QGFLKGYDMT
FTLFRFRFGY SLFLALVFKI FGSSIIVGKL FNVFLSVVLG LIIYFTVDYL FGKEAATYSA
ILYAFWPSQI MYNSVLASEH PFIVFFVLGL YFLLRAIKEK KAIFGIFAGV LVAISNHIRP
VAVVIIIAMV FLFALKALCK DFKILKSAIL SIISYVITFY TVGYLIFCLT GIPVWKTSMG
LNLMIGTDYT TYGMNNPKHS LFVKKYAYDF QKMHGEVMKI GLERLKKETK KFIAILPRKH
AIIWGDDSFG YFWSTFKVYK TTYFVNLVKI HPTIFYMFSQ LYY