Gene Hlac_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1059 
Symbol 
ID7400131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1055890 
End bp1057062 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content69% 
IMG OID643708127 
Productpoly-gamma-glutamate synthesis protein (capsule biosynthesis protein) 
Protein accessionYP_002565726 
Protein GI222479489 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAC GCCGGACTCT GCTGGCATCG GGCGTCGCCG GACTCGTGGG ACTCGCCGGG 
TGCGCCGCTA CGCCGCCGAC TGCGGACGAC GAACGTCGCA GAGCGACCGG CAACGCCTCC
GCCACGGGCG ACGACGACGC TGACGCGAGC GACGAGGACA CCACCGAAGG CGACGTGACC
CGGATCGGGT TCGTCGGCGA CCTGATGCTC GGCCGGAGCG TCAACGAGCG GTGGGTCGAC
GACGACAATC CTGAAAACGT CTGGGGATCG ACGCTCTCGC GGCTTCAGGA ACTCGACGGA
CTGGTCGGGA ACTTGGAGTG TTGCGTCTCC GATCGCGGGA CGCGCTGGCC GAACAAGGGG
TACTACTTCC GAGCGGCTCC CGCCTTCGCG GTGCCGGCCC TCGAAGCCGC AGGTGCCTCG
TTCGTCTCGC TCGCGAACAA TCACGTTCTC GACTACCGCG AGCCCGCGCT GCGCGACACC
GCCTCGCACC TGACCGACGC GGGAATCGCA CACGCCGGCG CCGGCACTAA CCGGGAGTCG
GCGCTCGAAC CCGCGGTGTT CGAGGCGGAC GACCTGACCG TCGCGGCGTT CGGCCTCACC
GACCAGTCCG AGGAGTTCGC GGCGGGAGCG TCGGAGCCGG GAACCGCCTT CGCGACGCTC
GATCCCGCCG TGTCCCCGAC GCGCTCGCTC GTCGAGGAGA TTCTCGACCG CGCGGAGACA
CACGACCCCG ATCTCGTCGT CGCCTCGCTC CACTGGGGAC CGAACTGGGA GACCGAACCC
CGAGCGGTCC ACGAGCGGTT CGGCCGGTGG CTCGTCGATC AGGGTGTCGA CGTGGTCCAC
GGCCACAGCG CGCACGTCCT CCAAGGGGTC GAGGTGTACC GAGGGCGCCC GATCATCTAC
GACGCGGGAG ACTTCGTCGA CGACTACGTC GACTACATCG ATCGGGAGGG CGTCCACAAC
AAGCGGAGCG CCCTCTTCGA GCTGGTCGTG CGCGACGGCG ACCTCGACGA GCTGGTCGTC
GAGCCGACCG CGATCGTCGA CGAGGCGGCG ACGCTGGCGG ACGACAATAT CGCCGAGTGG
GTGCGCGACA CCCTCGTAGA GCGGTCTGAG GCGTTCGGGA CCGAGGTCGA GCGGAGGGAC
GCCCGGTTGG CGTTCCCGCT GGGCGAGGAC TGA
 
Protein sequence
MRTRRTLLAS GVAGLVGLAG CAATPPTADD ERRRATGNAS ATGDDDADAS DEDTTEGDVT 
RIGFVGDLML GRSVNERWVD DDNPENVWGS TLSRLQELDG LVGNLECCVS DRGTRWPNKG
YYFRAAPAFA VPALEAAGAS FVSLANNHVL DYREPALRDT ASHLTDAGIA HAGAGTNRES
ALEPAVFEAD DLTVAAFGLT DQSEEFAAGA SEPGTAFATL DPAVSPTRSL VEEILDRAET
HDPDLVVASL HWGPNWETEP RAVHERFGRW LVDQGVDVVH GHSAHVLQGV EVYRGRPIIY
DAGDFVDDYV DYIDREGVHN KRSALFELVV RDGDLDELVV EPTAIVDEAA TLADDNIAEW
VRDTLVERSE AFGTEVERRD ARLAFPLGED