Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2242 |
Symbol | |
ID | 5734129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2858501 |
End bp | 2859625 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279383 |
Product | putative poly-gamma-glutamate biosynthesis (capsule formation)-like |
Protein accession | YP_001545010 |
Protein GI | 159898763 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00100002 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGTT TGAGTTTTTG GATACTCTTA GGAATAATTT TAGCAGCCTG CGGGAGCAGC ACGCCCACCA CTGAGCCAAG CCAATTGGCA CTTGCCCCAA CCGCAACGAT TGCGGTAACT GTAACCGCCG CACCAACCAA CTCTCCCGAA CCAACCAGTA CAACCCTGCC AACCGTAACC ACTGAGCCAA GCCCAACCCC TGAACCTAAA ATCGAATTGG CAGTGGTTGG TGATATTATG CTAGCTCGCT CGATCGGCGA GCGCATTCTT AGCGATAGCC CTGAGCAGCC CTTTGCCGGA GTGCGCGATG AATTAGTTAA TGCCGACCTG ACGATTGGCA ATCTTGAAAC GGCAATTGCT GATGCTGGCG AACCTGCGCC CAAAGCCTAC CGTTTTTTAG CGCCCCCCGA AAGTGTTGAT AGCCTTAGCG ATGCAGGCTT TGATCTAGTT TCGCTGGCCA ATAATCATAG CCTCGATTGG GGTGAATCGG CTTTAAGCGA GACAATTGGC CTATTGAATG AGGCTGAGAT TGCCAATGTT GGTGCAGGCA TGAACGCCGA ACAGGCCTAT CGTCCAGTTA TTATCGAGAA ACATGGCTTG CGTTTGGCGT TTCTGGCCTA TGTGAATGTG CCAGTTGAGC GTGGCGGATT TGTAACCGAA TCGTGGACAG CCACTGCCGA ACAAGCAGGC TTGGCTTGGG CCGAACCAGC AGTGATCGCG GCTGATGTCG CGGCAATTCG GCCAAGCGTC GATCATGTGA TTATCTTGCT GCATAGCGGC TATGAAGGGA TTGATCAACC AAATGAGATT CAGCGAAGCA ATGCCTATGC GGCACTCGAC GCTGGCGCAA CCTTAGTTTT GGGTGCACAT CCGCACGTGT TGCAAGGCTA TGAAGCCCGC CCGAATGGCC AATTTATTGC TTGGAGTTTG GGCAATTTTG TATTTGATGG CTTCGATGGT ACACCTAGTC TTGATAGTGC CATTTTACAT TTGACCCTCG ATAAAACGAG AGTCATCGCC TCACGCTGGA CACCAGTCCG CTTGATCGAT GGCTATCCAC AGGCGCTCGA TCCCACAACT GATGGAGCCT ATATCATTGA AAAAATTGAG CAATTGAGCA ATTAA
|
Protein sequence | MRRLSFWILL GIILAACGSS TPTTEPSQLA LAPTATIAVT VTAAPTNSPE PTSTTLPTVT TEPSPTPEPK IELAVVGDIM LARSIGERIL SDSPEQPFAG VRDELVNADL TIGNLETAIA DAGEPAPKAY RFLAPPESVD SLSDAGFDLV SLANNHSLDW GESALSETIG LLNEAEIANV GAGMNAEQAY RPVIIEKHGL RLAFLAYVNV PVERGGFVTE SWTATAEQAG LAWAEPAVIA ADVAAIRPSV DHVIILLHSG YEGIDQPNEI QRSNAYAALD AGATLVLGAH PHVLQGYEAR PNGQFIAWSL GNFVFDGFDG TPSLDSAILH LTLDKTRVIA SRWTPVRLID GYPQALDPTT DGAYIIEKIE QLSN
|
| |