Gene Rsph17029_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3659 
Symbol 
ID4898307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp760203 
End bp761246 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content71% 
IMG OID640114267 
Productcarbonic anhydrase 
Protein accessionYP_001045521 
Protein GI126464408 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0725502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.181373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGAGA AATGTCCGCC CCTGATCGCC GCCTACCACG GCATGTCGCC GGAGTTCGCG 
GGCGAGCCCG CCTTTGCGGG CGCGGGCGCC GCCGTCCTCG GGCGCGCGAG GCTCGGCCGG
GGCCTCTGGC TCGGCGCGCG CTCCGTGATC CGGGCCGACG GGCATCACAT CCATGTCGGC
GACGACTTCC ACCTCGGCGA GGGGGCGACC GTCCATATTG CGCACGATGT CTATCCGACG
CATGTCGGCC AGAATGTCAC CGCCGGCAAG GGTGCGGTGA TCCATGCCTG CACCATCGGC
GACAATTGCG TGATCGAGCG GGGGGCCGTC ATTCTCGACG GGTCCGAAGT GGCCGACGGA
GTGGTCGTGA CGGCGGGATC GGTGGTCTTT CCGCGCTCGA AGCTCGAGGC GGGCTGGCTC
TATTCCGGCA GCCCGGCGCA GCGCGTGGCC CGCGTCTCGG CCTCCGAGCT CGCCTCCTAC
CATCAGCAGA CGCGGAACGA CCTCTCCTCC GGAAAGGCCG GCCCGGCGGG CGACGGGGCA
GGTCGGGGCC ATGTCTTCGT GGCCCCCACG GCGACCTTGG CCGGCCGCGT CACCATGGAG
GAGGGGGTGG GCGTCTGGTA TGGCTGCCGG CTCGAGGCGG GCAGCCACGA GATCCGCATC
GGCGAGGGCA CCAACGTGCA GGACAACAGC ACGATCCTCT GCGAGACGCG GGACGTGGAG
ATCGGCCCCG ACGTCACCAT CGGCCACAAT GTCCTCCTCG TCGATTGCCG GGTCGAGCGG
GCGAGCCTCG TCGGCATCGG CTCGCGCATC GCGGCCGGCA CCGTGATCGA GAGCGACGTG
CTCGTCGCGG CCGGCACCGA GACCGAACCC GACCAGCGCC TCACCGGCGG CAAGGTCTGG
GCGGGGCGGC CCGCGCGCCC GATCGCCGAC ATGACCGACG CGCGCCGCGG CATGCTGGCC
GCGACCCTGC CCATGTATCG CGACTATGCC ACGCAGTTCG CCGGCACGTC GCACCAGCCG
ATGCTTCAGC CCGGGGAGGA GTGA
 
Protein sequence
MHEKCPPLIA AYHGMSPEFA GEPAFAGAGA AVLGRARLGR GLWLGARSVI RADGHHIHVG 
DDFHLGEGAT VHIAHDVYPT HVGQNVTAGK GAVIHACTIG DNCVIERGAV ILDGSEVADG
VVVTAGSVVF PRSKLEAGWL YSGSPAQRVA RVSASELASY HQQTRNDLSS GKAGPAGDGA
GRGHVFVAPT ATLAGRVTME EGVGVWYGCR LEAGSHEIRI GEGTNVQDNS TILCETRDVE
IGPDVTIGHN VLLVDCRVER ASLVGIGSRI AAGTVIESDV LVAAGTETEP DQRLTGGKVW
AGRPARPIAD MTDARRGMLA ATLPMYRDYA TQFAGTSHQP MLQPGEE