Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1556 |
Symbol | |
ID | 7409064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1646966 |
End bp | 1648213 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643715928 |
Product | poly-gamma-glutamate biosynthesis protein |
Protein accession | YP_002573427 |
Protein GI | 222529545 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0509053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGC TTTTATATGC ATTGGGGTCG TTTATCATAA TACTTTTGAT AGGCATAAAT GCTCTTTTTT ACTATGAAAA CATGTACATC AAAAAATACA TTCCTGTCTC AGAAGAGGTT TCCAGAAAAC CTCAGCAAAA GACTTTGCCA AAGACACAGC AGAATAAGAA TACTTCTGCT AAAAACAATT CTTTGTCTGT TAAGCTTAAG TCCCAACCAA AAGAATACAA GGCAAGTTTA TTTATGGCAG GAGATGTGTT TTTCAACGGC TATCTTTTAA AATCTTATTA TGACAGGCAA TCTCAGAATT ATGTATTCGG GGACATCTTG GAAAACGTTA AAGATATCAC TTATGCAGAC CTTAGTATTT TCAAGTTTGA CAGCACAATT ACTGACAATA TTCCTGTTTC AACATATGGG AAATACAACG CTCCAAAAGA GGCTTTAGAT GTGCTCAAGT CAGCTGGATT TAATCTTGCG GTACTTTCAT CATCGCATAT ATTTGATGGA AAGGTGGAAA GTTTGAAGAA AACAATAAAT AATTTGAAAG AGGCAAAGAT TGAAACAGTA GGCGTTAAGC TTTCTCAGGA AGATCATACC TCAAAGTTTT TTGATATAAA CAACATAAGA ATTGGCGTTG CTGCGTTCAC AAAAGAGCTT TCATCAGTCT ATTTGGGAGG AAGCTCTACT TATAAAGATT TTGTTAGTCT TCTTGACAAG GATGAGATAC AAAATGAGAT TGAGTACTTG AAAGGACTCA ACTGTGACAT TATAATAGCA TGTGCAAACT GGGGAGTTGA AAATTCAAAC TCAGTTAGTT TTGAGCAGAA AGAGTTTGCA AAAGAGCTTA TAAAGAATGG TGTTGATATT GTAATTGGTA CTCATACTCA TACAATTCAG CCGTTTGAAA AGGTTAAGGT TGAGGATGAG TCAGGCAATA TAAAAGAGGG GATAGTATTT TATTCGCTTG GTAATTTCCT GTGCGACCAG ACAGTTATTT TTCCATACAA TAGATTTGGT TTGACAGTGA GACTTGATCT TGTTAAAAAA GAAAATAAGC TTACTAAAAA GATATCAGTT GAGCCAATAT ATATCTTTAG AAAGACAAGA AGGAATGCAA GTTACTATGA TTTTATTGTT CTTAAAGCAA AAGACATTTT AAATAGAACT GATATAAAGA CATCCTACAT TAATTATGCC AAAAAACTGC TTGAAGATGT TGATAAATGG CTTAAAAATG TGCAATAA
|
Protein sequence | MRKLLYALGS FIIILLIGIN ALFYYENMYI KKYIPVSEEV SRKPQQKTLP KTQQNKNTSA KNNSLSVKLK SQPKEYKASL FMAGDVFFNG YLLKSYYDRQ SQNYVFGDIL ENVKDITYAD LSIFKFDSTI TDNIPVSTYG KYNAPKEALD VLKSAGFNLA VLSSSHIFDG KVESLKKTIN NLKEAKIETV GVKLSQEDHT SKFFDINNIR IGVAAFTKEL SSVYLGGSST YKDFVSLLDK DEIQNEIEYL KGLNCDIIIA CANWGVENSN SVSFEQKEFA KELIKNGVDI VIGTHTHTIQ PFEKVKVEDE SGNIKEGIVF YSLGNFLCDQ TVIFPYNRFG LTVRLDLVKK ENKLTKKISV EPIYIFRKTR RNASYYDFIV LKAKDILNRT DIKTSYINYA KKLLEDVDKW LKNVQ
|
| |