Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2565 |
Symbol | |
ID | 8384870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2629614 |
End bp | 2630711 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973642 |
Product | poly-gamma-glutamate synthesis protein (capsule biosynthesis protein) |
Protein accession | YP_003131462 |
Protein GI | 257053629 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTGGA CTCGTCGCGG ATTGTTGGCG GCAGGGGCGA CGGCCCTGGG AGGCTGTGCG AGCGACACCG AAACGCAGAG TCGACGTGAC GAGGACAGGT CCAGAGCATC TGCCACCGTC GGCTTCGTCG GCGATGCCAT GCTTGGACGG GGTGTGAACG ACCGCTGGAC CGACAACGAG GCCGCGGGCG TCTGGGGATC CACCATCGAG CGACTACAAG CCCTTGACGG ACTCGTGGTG AACCTGGAGT GCTGTATCGC CAGCCCCGAG AAGGGCGAGC GATGGCCGGG CAAGACCTAC TACTTCCGGG CCGAGCCCGA CTTCGCGGTA CCAGCACTCG AAGCAGCGAA CGTCTCGGTC GCCGCACTCG CCAACAACCA CGTGCTTGAC TTCGGTGAGT CGGGCCTCCA GAGGACGCTC GCTCACCTCG ATAATGCAGG CATCGCCCAC ACTGGGGCAG GGCCAAACCG CGGTACTGCG CTCGAACCCG CCGTCTTCGA CCCGAGCCTG ACCATCGCGG TGATCTCGCT CACCGACCGG TGGGCCGCCT ACGCCGCAGG TGAGCACAGT CCCGGCACAG CCCACACGCC ACTCGATCGC TCGGCAGGTT CGACGCGTGC CATCGTTCAA AACACCCTCG AGCGCGTCGA AACGGCCGAT CCCGACCTCG TCGTCGCGTC ACTGCACTGG GGGTCCAACT GGGAGACGTC CCCGAGTCCA ACCCAGCAGG CCTTCGCCCG GTGGCTCGTC GAGCAGGGCG TGGACGTGGT CCATGGCCAT AGCGCTCACG TGCTCCAGGG CGTCGAAGTG TACCAGGGCC GCCCAATTAT CTACGACGCC GGCGACTTCG TTGACGACTA CATTCACAAA GACGGACTGC ACAACAAGCG CAGCGCGCTG TTCGAGCTGG TCGTGACCGA CGGTCGTCTC GACGAGCTCC GGCTCGTCCC AGTCGAGATC GAGAACAAAG CTGTCTCTCT GGCTGATGCG GACGTTTCTC GGTGGGTGCA CGAGACGATA GCCGAGCGTT CGGAGCCGTT CGGCACCAGA TTCGAACGGA CCGACGACGG CGCGGTCGTT CCACTGGGAT CGTGCTGA
|
Protein sequence | MRWTRRGLLA AGATALGGCA SDTETQSRRD EDRSRASATV GFVGDAMLGR GVNDRWTDNE AAGVWGSTIE RLQALDGLVV NLECCIASPE KGERWPGKTY YFRAEPDFAV PALEAANVSV AALANNHVLD FGESGLQRTL AHLDNAGIAH TGAGPNRGTA LEPAVFDPSL TIAVISLTDR WAAYAAGEHS PGTAHTPLDR SAGSTRAIVQ NTLERVETAD PDLVVASLHW GSNWETSPSP TQQAFARWLV EQGVDVVHGH SAHVLQGVEV YQGRPIIYDA GDFVDDYIHK DGLHNKRSAL FELVVTDGRL DELRLVPVEI ENKAVSLADA DVSRWVHETI AERSEPFGTR FERTDDGAVV PLGSC
|
| |