Gene Caul_3267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3267 
Symbol 
ID5900722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3530387 
End bp3532045 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content67% 
IMG OID641563772 
Productnitrite/sulfite reductase hemoprotein beta subunit 
Protein accessionYP_001684892 
Protein GI167647229 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTGT ATGACGCCAT CGACCGCGAG TTCCTGACCG ACCGCTCCAA CGAGTTCCGC 
CACCAGGTGG CTCGCCGCCT GGCCGGCGAA CTGACCGAAG ACCAGTTCAA GCCGCTGCGG
CTGATGAACG GCCTCTACCT GCAGCTCCAC GCCTACATGC TGCGGGTCGC CGTGCCATAC
GGCGCGCTGA ACAGCCGCCA GGTTCGCCGC CTGGCCTGGG TGGCCAAGAC CTACGACAAG
GGCTACGGCC ACTTCACCAC CCGCCAAAAT CTTCAGCTGC ACTGGATCAA GCTGAAGGAC
GTGCCCGACA TCCTCGACGC CCTGGCCGAG GTCGACCTGC ACGCCATCCA GACCAGCGGG
AACTGCATTC GCAACGTCAC CGCCGATCCC TATGCCGGCG CCACGGCCGA GGAGATCGAC
GACCCGCGCA TCTGGTCCGA GGTGCTGCGC CAGTGGTCGA CCCTGCACCC CGAATTCTCG
TTCCTGCCGC GCAAGTTCAA GTTCGCCATC ACCGGAGCCT TGAAGGATCG CACGGCCGCC
AAGGTGCACG ACGTCGGCCT GATCCTGCGC AAGAACCGCG ATGGCGCGCT GGGCTTCGAG
GTGATCGTCG GCGGCGGCCA GGGGCGCACG CCCTATGTCG GCCCGACGAT CCGCCAGTTC
CTGCCGGCCG AGCACTTGCT CAGCTATGTC GAGGCGATCC TGCGCGTCTA CAACCGCCAC
GGCCGCCGCG ACAACATCTA CAAGGCCCGC ATCAAGATCC TGGTCGCCGC CCTGGGGGCC
GAGGCGTTCG CGCGCCAGGT CGAGGAGGAG TGGGGCAAGA TCGACCTGGC CCGCGCCGAC
CTGCCGGCCA CGGAACTGGC CCGGATCCGC GCCGCGTTCG CCGAGACGAA GTTCGAGACC
TTGCCGGAGA TTTCGGAAGT CCTGGAGGCC GCCCGCGCCG CCAGCCCGGC CCTGCGGCGC
TTCGTCCGCA ACAACGTCAA GCCGCACAAG CAAAGCGGCT ACGCCATCGT CGAGGTGTCG
TTGAAGGCCA TCGGCCAGAC GCCGGGCGAC GCCACGGCCG AGCAACTGGA GGTGGTCGCC
GATCTGGCCG AGCGCTACAG CCTGGACGAC CTGCGCGTGA CCCACGCCCA GAACCTGGTC
CTGCCGCACG TCAAGCTGGA CGACCTGCCG GTGGTCCATG CGATTCTGGA AAAGCACGGC
CTGGCCACCG CCAATATCGA CCTGGCCAGC GACATTATCG CTTGCCCGGG CCTGGACTAC
TGCGCCTTGG CGAATGCGCG CGCCATCCCG ATCGCCCAGG ACATCGCCCG CAAGTTCGCC
GACCCGGACC GGGCCGAGAA GGTGGGCGAG CTGAAGATCA AGATCAGCGG CTGCATCAAC
GCCTGCGGCC ACCACCATGT GGCTCACATC GGCATCCTGG GCGTCGACAA GAAGGGCGAG
GAATTCTACC AGCTCTCGCT CGGCGGATCG GGGGCGGAGG ACGCCAGCAT CGGCAAGATC
CTCGGCCCGG GCCTGTCGGC CGACAAGGTC GCGCCGGCCA TCGATTCCCT GGTCGAAGCC
TATCTGCGGA TCCGCACGGG CGAGGAGCGC TTCCTCGACA CCTATCGCCG CGTCGGTCTC
GAACCCTTCA AGGAGGCGGT CTATGCCCAG GCTGATTAA
 
Protein sequence
MYLYDAIDRE FLTDRSNEFR HQVARRLAGE LTEDQFKPLR LMNGLYLQLH AYMLRVAVPY 
GALNSRQVRR LAWVAKTYDK GYGHFTTRQN LQLHWIKLKD VPDILDALAE VDLHAIQTSG
NCIRNVTADP YAGATAEEID DPRIWSEVLR QWSTLHPEFS FLPRKFKFAI TGALKDRTAA
KVHDVGLILR KNRDGALGFE VIVGGGQGRT PYVGPTIRQF LPAEHLLSYV EAILRVYNRH
GRRDNIYKAR IKILVAALGA EAFARQVEEE WGKIDLARAD LPATELARIR AAFAETKFET
LPEISEVLEA ARAASPALRR FVRNNVKPHK QSGYAIVEVS LKAIGQTPGD ATAEQLEVVA
DLAERYSLDD LRVTHAQNLV LPHVKLDDLP VVHAILEKHG LATANIDLAS DIIACPGLDY
CALANARAIP IAQDIARKFA DPDRAEKVGE LKIKISGCIN ACGHHHVAHI GILGVDKKGE
EFYQLSLGGS GAEDASIGKI LGPGLSADKV APAIDSLVEA YLRIRTGEER FLDTYRRVGL
EPFKEAVYAQ AD