Gene Hhal_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0467 
Symbol 
ID4711546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp535421 
End bp536530 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID639854926 
Productribulose-bisphosphate carboxylase 
Protein accessionYP_001002057 
Protein GI121997270 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTGCCG AAACCCTGCG CGTGACCTAC TACCTGACCT GCCGCCCCGG CGAAGATCCC 
CACGACAAGG CCAAGGGCAT CGCGCTCGAG CAGAGCGCCG AGCTGCCCTC GCGCTGCATC
CCGGAGCACG TCTACGACGA CGTGGTGCCG ACGATCCAGG AGCTAACAGC GCTGGAGGAC
GGCCGCCACC GCCTGGTTCT CGACTTCCCG GAGGCGATCA CCGGCCTCGA ACCGACCCAG
CTGATCAACA ACCTGTTCGG CAATATCTCG CTCAAGAGCG GGATCCGCCT GGCCGACGTG
GAGTGGACGC CCAACCTCCT GCGCGCCCTG GGCGGGCCGC GCTACGGGAC CGCCGGCGTA
CGCGAGATGC TCGGCATCGG CGAGCGGCCG ATCAGCTCCA CGGCGCTCAA ACCCCTGGGC
CTGGACACCG CCACGCTGGC GGGCTTCTGC GCCGACTTTG CCCGCGGCGG CATCGACCTG
ATCAAAGACG ACCACGGGCT CTGCGACCAG GACACCTCCC GCTTCGTCGA TCGCGTGCAG
GCCTGCCAGC GGGCGGTCAA CGAGGTCAAT GCCGAGACCG GCGGCCGCTC GCTCTACCTA
CCCAATGTCA CCGGCCCCCG CTGGGAGCTG GACAAGCGCC TCGACGCCGC GCAGGAGGCC
GGCTGCAAGG CGGTCCTCAT CTGCCCCTTC CTCACCGGTC TCGATGCGCT GATCTGGGCC
CGCGAACGCT ACGACATGGC CCTGATGGCC CACCCGGCCT TCGCCGGCGC GGTGGCCGGC
GCCGAGCACG GCATCGACCC CGCCCTGCTG CTCGGCGAGA TCACCCGCCT GTTCGGTGCG
GATATGGTGG TCTACACCAA CGCCGAGGGG CGCTTCCCCA CTTACGATCA GGCGCTGTGC
GACCGCATCA ACGACCGGCT GCGCCGCCCC CTGGGCGACA TCCGCCCGGC TCTGCCCACG
CCGGGCGGCG GTGTGGACGC CGCACGCGCG CCGTATTGGG CCGAGCGCTA CGGGCCCGAC
GTGGTACTGC TGATCGGTGG CAGCCTCTAC GCCCAGGGCG ATCGGGCCGC CGCCGCACGC
CGTCTGCAGG ATGTGGTAGA GGGTCAGTAA
 
Protein sequence
MSAETLRVTY YLTCRPGEDP HDKAKGIALE QSAELPSRCI PEHVYDDVVP TIQELTALED 
GRHRLVLDFP EAITGLEPTQ LINNLFGNIS LKSGIRLADV EWTPNLLRAL GGPRYGTAGV
REMLGIGERP ISSTALKPLG LDTATLAGFC ADFARGGIDL IKDDHGLCDQ DTSRFVDRVQ
ACQRAVNEVN AETGGRSLYL PNVTGPRWEL DKRLDAAQEA GCKAVLICPF LTGLDALIWA
RERYDMALMA HPAFAGAVAG AEHGIDPALL LGEITRLFGA DMVVYTNAEG RFPTYDQALC
DRINDRLRRP LGDIRPALPT PGGGVDAARA PYWAERYGPD VVLLIGGSLY AQGDRAAAAR
RLQDVVEGQ