Gene Hhal_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1999 
Symbol 
ID4710416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2203460 
End bp2204749 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID639856472 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001003565 
Protein GI121998778 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00638776 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTAC TGGTGATCGG AGGGGGCGGC CGCGAGCACG CCATGGCCTG GGCCCTGGCG 
CGCTCCAAGC AGGTCGAGGA GGTGCTGGTC GCCCCGGGGA ATGCCGGCAC GGAGCGCGAG
CCCAAGGTGC GCAACATCCA GGTGGGGGCG GAGGATATCC CCGCGCTGGT CCAACAGGCC
CGCGAGCAGG AGGTGGCTTT GACCGTGGTC GGCCCCGAGG CGCCCCTGGT GGCCGGCGTG
GTGGACGCCT TCCAGCAGGC GGGTCTGGCG TGTCTGGGGC CGACGGCGGA CGCCGCCGAG
CTGGAGGGCT CGAAGGCGTT TGCCAAGGCG TTCATGGCGC GTCACGGGGT GCCAACGGCC
GCCTACCGCA CCTTCGACGA CCTGGGGGCG GCCAGCGACT ATATCCGCGA GCACTCGACG
CCGATGGTCA TCAAGGCGGA CGGCCTGGCC TCCGGCAAGG GGGTCGAGGT GGCGGCGACC
AAGGACGAGG CCCTATTGGC GGCCGAGCGC ATGCTCTCGG GGCAGGCCTT CGGGGATGCC
GGCGCGCGGG TCGTGGTCGA GGAGTGCCTG CAGGGCGAGG AGCTGAGCTT CATCGCGCTG
GTCGATGGCG AGCACGTGGT GGCGATGGCC AGCTCCCAGG ATCACAAGCC GCGGGACGAC
GGAGATCGGG GCCCCAACAC CGGTGGTATG GGGGCCTATT CGCCGGCGCC GCTGATGGAT
GAGCAGCTCT ACCAGCGGGT CATGGACGAG GTGATCCGCC CCACGGTCCA GGGGCTGGCC
GCCGAGGGGC GCCCCTATCA GGGCTTCCTC TACGCCGGAC TGATGATCGA CGCCGACGGC
AACCCCCGGG TGCTGGAGTA CAACTGCCGC CTGGGTGACC CGGAGGCGCA GCCGCTGTTG
ATGCGCCTGG ACGCGGATTT TGCCGAAGTC TGCCGGGCCG CCCTCGAGGG GCGACTGGGC
GAGGTGGATC TGGCCTGGGA CTCGCGTCCT GCTGTGGGCG TGGTGATGGC GGCGGCCGGC
TATCCAGGGT CGGTGGAGCG GGGCGATGTC ATCGAAGGGC TCGACGACGC CGAGGCCACC
GGCTGCAAGG TCTTCCACGG CGGCACGACC TTCGACGCCG ACGGCCGTGT GGTGACCAAC
GGGGGGCGGG TGCTGTGCTG TTGCGCGCTG GGTGAGCGTG TCTCTGCAGC GCAGCAGGCG
GCCTACCGGG GGGTGGCGGC CATCCATTGG GAAGGGGTGT TCTACCGGCG GGATATCGGT
GCGCGGGCCA TCGCCCGGGA GACCGGCTGA
 
Protein sequence
MKVLVIGGGG REHAMAWALA RSKQVEEVLV APGNAGTERE PKVRNIQVGA EDIPALVQQA 
REQEVALTVV GPEAPLVAGV VDAFQQAGLA CLGPTADAAE LEGSKAFAKA FMARHGVPTA
AYRTFDDLGA ASDYIREHST PMVIKADGLA SGKGVEVAAT KDEALLAAER MLSGQAFGDA
GARVVVEECL QGEELSFIAL VDGEHVVAMA SSQDHKPRDD GDRGPNTGGM GAYSPAPLMD
EQLYQRVMDE VIRPTVQGLA AEGRPYQGFL YAGLMIDADG NPRVLEYNCR LGDPEAQPLL
MRLDADFAEV CRAALEGRLG EVDLAWDSRP AVGVVMAAAG YPGSVERGDV IEGLDDAEAT
GCKVFHGGTT FDADGRVVTN GGRVLCCCAL GERVSAAQQA AYRGVAAIHW EGVFYRRDIG
ARAIARETG