Gene Athe_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1556 
Symbol 
ID7409064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1646966 
End bp1648213 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content33% 
IMG OID643715928 
Productpoly-gamma-glutamate biosynthesis protein 
Protein accessionYP_002573427 
Protein GI222529545 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0509053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGC TTTTATATGC ATTGGGGTCG TTTATCATAA TACTTTTGAT AGGCATAAAT 
GCTCTTTTTT ACTATGAAAA CATGTACATC AAAAAATACA TTCCTGTCTC AGAAGAGGTT
TCCAGAAAAC CTCAGCAAAA GACTTTGCCA AAGACACAGC AGAATAAGAA TACTTCTGCT
AAAAACAATT CTTTGTCTGT TAAGCTTAAG TCCCAACCAA AAGAATACAA GGCAAGTTTA
TTTATGGCAG GAGATGTGTT TTTCAACGGC TATCTTTTAA AATCTTATTA TGACAGGCAA
TCTCAGAATT ATGTATTCGG GGACATCTTG GAAAACGTTA AAGATATCAC TTATGCAGAC
CTTAGTATTT TCAAGTTTGA CAGCACAATT ACTGACAATA TTCCTGTTTC AACATATGGG
AAATACAACG CTCCAAAAGA GGCTTTAGAT GTGCTCAAGT CAGCTGGATT TAATCTTGCG
GTACTTTCAT CATCGCATAT ATTTGATGGA AAGGTGGAAA GTTTGAAGAA AACAATAAAT
AATTTGAAAG AGGCAAAGAT TGAAACAGTA GGCGTTAAGC TTTCTCAGGA AGATCATACC
TCAAAGTTTT TTGATATAAA CAACATAAGA ATTGGCGTTG CTGCGTTCAC AAAAGAGCTT
TCATCAGTCT ATTTGGGAGG AAGCTCTACT TATAAAGATT TTGTTAGTCT TCTTGACAAG
GATGAGATAC AAAATGAGAT TGAGTACTTG AAAGGACTCA ACTGTGACAT TATAATAGCA
TGTGCAAACT GGGGAGTTGA AAATTCAAAC TCAGTTAGTT TTGAGCAGAA AGAGTTTGCA
AAAGAGCTTA TAAAGAATGG TGTTGATATT GTAATTGGTA CTCATACTCA TACAATTCAG
CCGTTTGAAA AGGTTAAGGT TGAGGATGAG TCAGGCAATA TAAAAGAGGG GATAGTATTT
TATTCGCTTG GTAATTTCCT GTGCGACCAG ACAGTTATTT TTCCATACAA TAGATTTGGT
TTGACAGTGA GACTTGATCT TGTTAAAAAA GAAAATAAGC TTACTAAAAA GATATCAGTT
GAGCCAATAT ATATCTTTAG AAAGACAAGA AGGAATGCAA GTTACTATGA TTTTATTGTT
CTTAAAGCAA AAGACATTTT AAATAGAACT GATATAAAGA CATCCTACAT TAATTATGCC
AAAAAACTGC TTGAAGATGT TGATAAATGG CTTAAAAATG TGCAATAA
 
Protein sequence
MRKLLYALGS FIIILLIGIN ALFYYENMYI KKYIPVSEEV SRKPQQKTLP KTQQNKNTSA 
KNNSLSVKLK SQPKEYKASL FMAGDVFFNG YLLKSYYDRQ SQNYVFGDIL ENVKDITYAD
LSIFKFDSTI TDNIPVSTYG KYNAPKEALD VLKSAGFNLA VLSSSHIFDG KVESLKKTIN
NLKEAKIETV GVKLSQEDHT SKFFDINNIR IGVAAFTKEL SSVYLGGSST YKDFVSLLDK
DEIQNEIEYL KGLNCDIIIA CANWGVENSN SVSFEQKEFA KELIKNGVDI VIGTHTHTIQ
PFEKVKVEDE SGNIKEGIVF YSLGNFLCDQ TVIFPYNRFG LTVRLDLVKK ENKLTKKISV
EPIYIFRKTR RNASYYDFIV LKAKDILNRT DIKTSYINYA KKLLEDVDKW LKNVQ