Gene Caul_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1801 
Symbol 
ID5899256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1903246 
End bp1904451 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID641562291 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001683428 
Protein GI167645765 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.195025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0204354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCCCA GTCCGCCCGC CGCCGCCTAT CTCGACGACG CGTTCTTCGA GGGCCTTTCG 
CGCTCGATGC TCGACGTCGG CGCGGCCGAG ACCCTGCCGC CGGCCTGCTA CACCGACGCG
GCCTTCTACG CCTTCGAGAA GGAGGCGCTG TTCAATCACG AATGGCTGTG CGTGGGCCGC
GTGGATTGGG TGAAGGCGCA GGGCGACTAT TTCACCACCA CGATCATTGG CGAGCCGATC
ATCGTCACCC GCAACCGCTC CGACGAGATC AAGGCGATGT CGGCCGTCTG CCAGCATCGC
GCCATGCTGG TGGCCGAAGG CCGGGGCAAC ACGCGCGGCT TCGTCTGCCC CTATCACCAC
TGGGTCTATT CGCTGAACGG CGACCTGGTG AACGCCCCGG CCATGGAGCG CACCTGCGAC
TTCGACAAGA AGTCGATCAA GCTTCCAACC TTCAAGGTCG AGGTCTGGCT GGGCTTCATC
TTCATCAATT TCGACGATGC GGCTCCCCCC TTGGCGCCCC GCCTGGAAGC CGTCGAGAGC
GCCATCGCCA ATTTCGATCT GTCGAACGCC GAGGGCCTGA CCCCGCCGAT GACCGGCCAG
TTCGCCTGGA ACTGGAAGGT GATGTTCGAG AACAACAACG ACGGCTACCA CGCCAACAAG
CTGCACCGCG GTCCGTTGCA CGATTTCATT CCCAGCGAGC TGTGCAGCTT CCCGGACGCC
GCCGACGGCG ACGCGGGCTT CCTGCGCTTC AACGGCACGC TGCATCCCGA CGCCAGCTTC
AATCCGACCC AGAAGGCGGT GCTGCCGATC TTCCCGAAGC TGACGGACGA GGACCGCAAC
CGCGCCACCT TCGCCAACAT CCCGCCGACG CTGTCGCTGG TGATGACCAG CGACATGGTC
ATCTATCTGA TCCTGCGCCC CACCGGTCCG GAGACCATGG AGCAGGACAC CGGCGTCCTG
GTCGCGCCCG GCGCCACCGA GATTCCCGGC TTCGACGAGC GGCTGGAGAT GATCATGACC
TCCGCCGGCA AGATCATCGC CCAGGACATG CATGTCGACG AACTGGTCCA GGTGGGCCTC
CGCTCGCGGT TCGCGGTGCG GGGTCGCTAC TCCTGGCAGG AGGGGGCGCA GGTGCAGTTC
AACCGCTGGC TCACCCCGCG CTACCAGAAA GCCTGGGCGG CGATGAGCAA GGGAGCCGCC
GCATGA
 
Protein sequence
MGPSPPAAAY LDDAFFEGLS RSMLDVGAAE TLPPACYTDA AFYAFEKEAL FNHEWLCVGR 
VDWVKAQGDY FTTTIIGEPI IVTRNRSDEI KAMSAVCQHR AMLVAEGRGN TRGFVCPYHH
WVYSLNGDLV NAPAMERTCD FDKKSIKLPT FKVEVWLGFI FINFDDAAPP LAPRLEAVES
AIANFDLSNA EGLTPPMTGQ FAWNWKVMFE NNNDGYHANK LHRGPLHDFI PSELCSFPDA
ADGDAGFLRF NGTLHPDASF NPTQKAVLPI FPKLTDEDRN RATFANIPPT LSLVMTSDMV
IYLILRPTGP ETMEQDTGVL VAPGATEIPG FDERLEMIMT SAGKIIAQDM HVDELVQVGL
RSRFAVRGRY SWQEGAQVQF NRWLTPRYQK AWAAMSKGAA A