Gene Haur_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2242 
Symbol 
ID5734129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2858501 
End bp2859625 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID641279383 
Productputative poly-gamma-glutamate biosynthesis (capsule formation)-like 
Protein accessionYP_001545010 
Protein GI159898763 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00100002 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGTT TGAGTTTTTG GATACTCTTA GGAATAATTT TAGCAGCCTG CGGGAGCAGC 
ACGCCCACCA CTGAGCCAAG CCAATTGGCA CTTGCCCCAA CCGCAACGAT TGCGGTAACT
GTAACCGCCG CACCAACCAA CTCTCCCGAA CCAACCAGTA CAACCCTGCC AACCGTAACC
ACTGAGCCAA GCCCAACCCC TGAACCTAAA ATCGAATTGG CAGTGGTTGG TGATATTATG
CTAGCTCGCT CGATCGGCGA GCGCATTCTT AGCGATAGCC CTGAGCAGCC CTTTGCCGGA
GTGCGCGATG AATTAGTTAA TGCCGACCTG ACGATTGGCA ATCTTGAAAC GGCAATTGCT
GATGCTGGCG AACCTGCGCC CAAAGCCTAC CGTTTTTTAG CGCCCCCCGA AAGTGTTGAT
AGCCTTAGCG ATGCAGGCTT TGATCTAGTT TCGCTGGCCA ATAATCATAG CCTCGATTGG
GGTGAATCGG CTTTAAGCGA GACAATTGGC CTATTGAATG AGGCTGAGAT TGCCAATGTT
GGTGCAGGCA TGAACGCCGA ACAGGCCTAT CGTCCAGTTA TTATCGAGAA ACATGGCTTG
CGTTTGGCGT TTCTGGCCTA TGTGAATGTG CCAGTTGAGC GTGGCGGATT TGTAACCGAA
TCGTGGACAG CCACTGCCGA ACAAGCAGGC TTGGCTTGGG CCGAACCAGC AGTGATCGCG
GCTGATGTCG CGGCAATTCG GCCAAGCGTC GATCATGTGA TTATCTTGCT GCATAGCGGC
TATGAAGGGA TTGATCAACC AAATGAGATT CAGCGAAGCA ATGCCTATGC GGCACTCGAC
GCTGGCGCAA CCTTAGTTTT GGGTGCACAT CCGCACGTGT TGCAAGGCTA TGAAGCCCGC
CCGAATGGCC AATTTATTGC TTGGAGTTTG GGCAATTTTG TATTTGATGG CTTCGATGGT
ACACCTAGTC TTGATAGTGC CATTTTACAT TTGACCCTCG ATAAAACGAG AGTCATCGCC
TCACGCTGGA CACCAGTCCG CTTGATCGAT GGCTATCCAC AGGCGCTCGA TCCCACAACT
GATGGAGCCT ATATCATTGA AAAAATTGAG CAATTGAGCA ATTAA
 
Protein sequence
MRRLSFWILL GIILAACGSS TPTTEPSQLA LAPTATIAVT VTAAPTNSPE PTSTTLPTVT 
TEPSPTPEPK IELAVVGDIM LARSIGERIL SDSPEQPFAG VRDELVNADL TIGNLETAIA
DAGEPAPKAY RFLAPPESVD SLSDAGFDLV SLANNHSLDW GESALSETIG LLNEAEIANV
GAGMNAEQAY RPVIIEKHGL RLAFLAYVNV PVERGGFVTE SWTATAEQAG LAWAEPAVIA
ADVAAIRPSV DHVIILLHSG YEGIDQPNEI QRSNAYAALD AGATLVLGAH PHVLQGYEAR
PNGQFIAWSL GNFVFDGFDG TPSLDSAILH LTLDKTRVIA SRWTPVRLID GYPQALDPTT
DGAYIIEKIE QLSN