Gene Caul_0702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0702 
Symbol 
ID5898157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp762540 
End bp763859 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content64% 
IMG OID641561184 
Productcytochrome b/b6 domain-containing protein 
Protein accessionYP_001682333 
Protein GI167644670 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0895628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.934127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC ATTCGACCTA CGAACCCAAG ACCGGTATCG AGCGCTGGCT CGACGCTCGT 
CTGCCGATCG TGCGCCTGGG CTATGACAGC TTCGTCGACT ACCCCACGCC GCGGAACCTG
AACTACTGGT GGACCTTCGG CGGCATCCTG TCGCTGTGCC TGGCCTCGCA GTTGATCACC
GGCATCATCC TGGTGATGCA CTACACCCCC AGCGCCGACC ACGCCTTCGC CTCCGTCGAG
CACATCATGC GCGACGTGAA TTACGGCTGG CTGATCCGCT ACATGCACGC CAACGGCGCG
TCGATGTTCT TCATCGCCGT CTATATCCAC ATGCTGCGCG GCCTCTACTA CGGCTCCTAC
AAGGCGCCCC GCGAAGTGCT GTGGCTGCTG GGCTGCGTGA TCTACCTGCT GATGATGGCC
ACCGCCTTCA TGGGCTACGT GCTGCCCTGG GGCCAGATGA GCTTCCACGG CGCGGTCGTG
ATTACCAACC TGTTCGGCGC CCTTCCGGTC GTCGGTCCGG CGATCACCAC CTGGCTGTGG
GGCGGCTTCG CGGTCGATAA CCCCACCCTC AACCGCTTCT TCTCGCTGCA TTACCTGCTG
CCCTTCATGA TCGCCGGCGT GGTGATCCTG CACATCTGGG CCCTGCACGT GGTCGGCCAG
AACAACCCGA CCGGCGTCGA GCCGAAGTCG AAGGCCGATG TCCTGCCCTT CACCCCGTAT
GCGACGGTGA AGGACGGCTT CGCGATGAGC ATCTTCCTGA TCCTTTTCGC CTTCTTCGTC
TTCTTCATGC CCAACGCCCT GGGTCACGCC GACAACTACA TCCCGGCCAA CCCGCTGGTG
ACGCCGTCGC ACATCGTTCC GGAATGGTAC TTCCTGCCGT TCTACGCGAT CCTGCGCGCC
GTTCCGGACA AGCTGATGGG CGTGCTGGCC ATGTTCGGCG CCATCGCCTG CCTGTTCGCC
CTGCCGTGGC TGGACCGGTC GAAGGTGCGC TCGATGCGCT ATCGCCCGAC CGCGAAGATC
CACTTCTTCA TCTTCGTGGT GGCCTGCTGC GTCCTGGGCG TCTGCGGCGC CAAGCTGCCC
GACGATCCGG TCATCCCGGG CCTGACCACC TTCCAGCTGT TCGATTCGGA CCTGAACAGC
TTCGTCTGGC TCAGCCGGGT GGCCGCGCTG TACTACTTCG CTTTCTTTGT CGTCGTGATG
CCGTTCCTGC CGCTGTCGGA AAAGACCCTG CCGGTGCCGG ACTCCATCGC GTCTCCCGCC
CTGTCGCACC CGGCCGCCGC GCCGGCCCAA GCGACCGCCG CGCCTGAAAT GAAGGGCTGA
 
Protein sequence
MSGHSTYEPK TGIERWLDAR LPIVRLGYDS FVDYPTPRNL NYWWTFGGIL SLCLASQLIT 
GIILVMHYTP SADHAFASVE HIMRDVNYGW LIRYMHANGA SMFFIAVYIH MLRGLYYGSY
KAPREVLWLL GCVIYLLMMA TAFMGYVLPW GQMSFHGAVV ITNLFGALPV VGPAITTWLW
GGFAVDNPTL NRFFSLHYLL PFMIAGVVIL HIWALHVVGQ NNPTGVEPKS KADVLPFTPY
ATVKDGFAMS IFLILFAFFV FFMPNALGHA DNYIPANPLV TPSHIVPEWY FLPFYAILRA
VPDKLMGVLA MFGAIACLFA LPWLDRSKVR SMRYRPTAKI HFFIFVVACC VLGVCGAKLP
DDPVIPGLTT FQLFDSDLNS FVWLSRVAAL YYFAFFVVVM PFLPLSEKTL PVPDSIASPA
LSHPAAAPAQ ATAAPEMKG