Gene Hhal_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2011 
Symbol 
ID4710408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2215256 
End bp2216536 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content70% 
IMG OID639856484 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001003577 
Protein GI121998790 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.956002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCA CCCATCAGTT ATTCCAGCAG GCCCGGGAGC TGATCCCCGG GGGCGTCAAC 
TCCCCGGTGC GCGCGTTCAA GGGGGTCGGT GGCGAGCCGA TCTTCTTCGA GCGCGGCGAG
GGGCCGTATA TGTGGGACGT GGACGGCAAG CGCTACGTCG ATTACGTCTG TTCCTGGGGG
CCGCTGGTCG CCGGGCACGC CGACCCGGAG ATCGTCCGGC GCGTTCAGGA GACCGCCGCC
AAGGGGCTCT CCTTCGGCAC CCCGGTGGAG CTCGAGGTGG AGATGGCCCG CACCCTCTGC
CAGCACGTGC CGTCGCTGGA GATGGTGCGC CTGGTCAACT CCGGCACCGA GGCGACCATG
AGTGCCCTGC GCCTGGCCCG CGGCTTTACC GGTCGCGACA AGATCGTCAA GTTCCAGGGC
AACTATCACG GTCACGTCGA CGCCCTGCTG GCGCAGGCCG GCTCCGGCGC CCTGACCCTG
GGCGTGCCCG GTTGTCCCGG TGTCCCCGAG GCGGTGGTCG CCGAGACGCT CACTGTGCCC
TATAACGATA TCGAGGCGGT GGAGCAGTGC TTCAACGAAC ACGGCAGCGA GATCGCTGCG
GTCATCGTCG AGCCGGTGGC CGGCAACATG AACTGCGTGC CGCCAGTGCC CGGCTTCCTG
GAGAAGCTGC GCGAGGTCTG CGACCGCACC GACGCCCTGC TGATCTTCGA TGAGGTGATG
ACCGGCTTCC GCGTCGGGCC GCAGTGCGCC CAGGGGCGCT ACGGCATCAC CCCGGATCTG
ACGTGCCTGG GCAAGGTCGT CGGCGGCGGC ATGCCGGTCG GCGCCTTCGG CGGCCGGCGC
GAGATCATGG AGGGGCTGGC CCCGACCGGT GGGGTCTACC AGGCCGGGAC GCTCTCCGGA
AACCCGGTGA CCATGGCCGC CGGGCTGGCG ACCCTGGAGC GGATCACTGC GCCGGGTGCC
TTCGAGGGGC TGGAGCAGAC CACCACGCGG GTGGTCGACG GCATCAAGGA GCGCGCCGAC
GCCGCGGGCA TCCCGCTGGC CACCAACCAA GCGGGCAGCA TGTTCGGCCT GTTCTTCACC
GACGACGCGC CGGTGACCCG TTTCGAGCAG GTCAAGGCGT GCGACCTGGA TGCGTTCAAC
CGCTTCTTCC ACGCCATGCT CGACGAGGGG GTCTACCTGG CCCCGGCGGC CTTCGAGGCC
GGCTTCGTCT CGCTGGCCCA CGACGATAAC GCCGTGCAGG AGACCCTGGA CGCTGCCGAG
CGCGCCTTCG CCCGCGTCTG A
 
Protein sequence
MQRTHQLFQQ ARELIPGGVN SPVRAFKGVG GEPIFFERGE GPYMWDVDGK RYVDYVCSWG 
PLVAGHADPE IVRRVQETAA KGLSFGTPVE LEVEMARTLC QHVPSLEMVR LVNSGTEATM
SALRLARGFT GRDKIVKFQG NYHGHVDALL AQAGSGALTL GVPGCPGVPE AVVAETLTVP
YNDIEAVEQC FNEHGSEIAA VIVEPVAGNM NCVPPVPGFL EKLREVCDRT DALLIFDEVM
TGFRVGPQCA QGRYGITPDL TCLGKVVGGG MPVGAFGGRR EIMEGLAPTG GVYQAGTLSG
NPVTMAAGLA TLERITAPGA FEGLEQTTTR VVDGIKERAD AAGIPLATNQ AGSMFGLFFT
DDAPVTRFEQ VKACDLDAFN RFFHAMLDEG VYLAPAAFEA GFVSLAHDDN AVQETLDAAE
RAFARV