Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1734 |
Symbol | |
ID | 4270841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1983309 |
End bp | 1984436 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638126492 |
Product | putative capsule biosynthesis protein |
Protein accession | YP_742570 |
Protein GI | 114320887 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.720674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.681474 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCGAT GGAGTCGTCC GGTGGAGCAC AGCGCAACGC TTTTCCTCGC GGGGGATGTG ATGACCGGTC GTGGCATCGA CCAGGTGCTG CCGCACTCGG TGGATCCGCG GATTTACGAG CCGGTGATGG ACTCGGCCGA GGGGTATGTG GCGCTCGCGG AGCGGGCAAA CGGGCCGATA CCCAGACCGG TGGATTACGC CTATGTCTGG GGCGATGCCC TGGCGGAGCT GCGGGCGTGG GCGCCGGCGC CGCGCCTGAT CAACCTGGAG ACGGCGGTCA CCACCTGCGA CGACCCCGAG CCCAAGGGCA TCCATTACCG CATGCACCCG GGTAATGTGC CGCTGCTGCA GGCGGCTGGC GTGGACGGTT GCGTCCTCGC CAACAACCAC GTGCTGGACT GGGGCCGGGC CGGTTTGCTG GAGACGCTGG AGGCACTGGA CCGGGCGGGC ATCCGTTATG TCGGGGCCGG TCGGGACCGG CGGGAGGCGG CGGCGCCGAC GGTATTTGGG GTGCCCGGTG GGCGGCTGGT GCTGTACGCG CTGGGTATGC CCAGCAGCGG AGTGCCGGCG GCCTGGGCGG CCACCGAGGA GCAGCCGGGC GTCAATGTGT TGCCCCGGCC CTCGGATGAG GCCGTGACCC TGCTGGCCGA GGCCGTCCGC CGTGAGCGCC GACCCGGCGA CGTGGTGGCG GTCTCGCTGC ACTGGGGCTC CAACTGGGGC TACGCCATCC CCCGGGCACA TCGGGTGTTT GCCCACGCGC TGATCGATCG GGCCGGGGTC GATCTGGTCT GGGGTCACTC TTCCCATCAT GTGCGCGGTA TCGAGGTCTA CAAGGACCGC CTGATCCTCT ACGGTTGTGG TGATTTCCTC AATGACTACG AGGGGATCGG TGGCCATGAG GCCTTCCGCG GCGACTTGGT GCTGATGTAC CTCCCAGCAC TGGCGCTCGC CGACGGCCGG CTGCGGGCCC TTACCCTGGT ACCGCTGCAG ATCCGGAACT TCCGGCTGCA CCGGGCGAAG CGGGCGGACG CCGAATGGCT GGCCGCGGTG CTGGACCGCG AGGGACGGGC GCTGGGCACC TCGGTGACCG TGAGCGCGGA CGGTCGCCTG GCGCTGCACT GGCGGTGA
|
Protein sequence | MGRWSRPVEH SATLFLAGDV MTGRGIDQVL PHSVDPRIYE PVMDSAEGYV ALAERANGPI PRPVDYAYVW GDALAELRAW APAPRLINLE TAVTTCDDPE PKGIHYRMHP GNVPLLQAAG VDGCVLANNH VLDWGRAGLL ETLEALDRAG IRYVGAGRDR REAAAPTVFG VPGGRLVLYA LGMPSSGVPA AWAATEEQPG VNVLPRPSDE AVTLLAEAVR RERRPGDVVA VSLHWGSNWG YAIPRAHRVF AHALIDRAGV DLVWGHSSHH VRGIEVYKDR LILYGCGDFL NDYEGIGGHE AFRGDLVLMY LPALALADGR LRALTLVPLQ IRNFRLHRAK RADAEWLAAV LDREGRALGT SVTVSADGRL ALHWR
|
| |