Gene Caul_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3891 
Symbol 
ID5901353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4210804 
End bp4212780 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content71% 
IMG OID641564412 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_001685514 
Protein GI167647851 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.109986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.46196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACG AATTTGGCGC CTACGCGCTG GTCCTGGCCC TGGTGCTGTC GGCGCTGCAG 
ATGACCCTGT CGGCGATTGG CGGCGCGCGG CGGTCGTCGA TGCTGGCCGG GGCAGGACGG
GGGGCGGCGG TCGGCGCCTT CCTGTGTACG GCCCTGACCT TCCTGGCCCT GATCCACGCC
TTCGTGGTCT CGGACTTCTC GGTGGCCAAC GTCGCGGCCA ACTCCCACAC CGCCAAGCCG
ATGCTCTACA AGGTGGCCGG CGCCTGGGGC AGCCACGAGG GCTCGATGCT GCTGTGGTGC
CTGGTGCTGA CCGGCTACGG CGCGGCGATC GCGACTTTCG GCGAGGCCCT GCCGCCGCGC
CTGCGGGCCT TTGCCGTCGC CGTCCAGGGC GCGCTGGGCG TGCTGTTCCT GGCCTACACC
GTCTTCGCCT CCAACCCGAT GGCCCGACTG GTCGACGTAC CGGTCGAGGG CGCGTCGCTG
AACCCCCTGC TGCAGGACCC GGCGCTCGCC TTCCACCCGC CGTTCCTCTA CTGCGGCTAT
GTCGGCTTCT CGGTGGTGTT CTCGTTCGCC ATGGCCGCCC TGATCGAGGG CCGGGTCGAC
GCGGCCTGGG CCCGCTGGGT GCGGCCCTGG ACCCTGGCGG CCTGGAGCTT CCTGACCGTC
GGCATCACGC TCGGCGCCTT CTGGGCCTAT TACGAACTGG GCTGGGGCGG CTGGTGGTTC
TGGGACCCGG TCGAGAACGC CAGCTTCATG CCCTGGCTGA TCGGCACGGC CTTGCTGCAC
TCGGCCATCG TGCTGGAAAA GCGCGGCGCC CTGCCGGGCT GGACGGTGTT CCTGGGCCTG
GCGGCCTTCA CCTTCTCGAT GCTGGGCGCC TTCCTGGTGC GCTCCGGCGT GCTGACCAGC
GTCCACGCCT TCGCCGTCGA TCCGACGCGC GGGGTTCTGT TGCTGAGCAT CATGGGCCTG
GCGGCGGGGG CCGGCTTCGT GCTGTTCGCC CTGCGCGCGC CCAGCCTGAA CGCCGGCGGT
CTGTTCGCGC CGGTCAGCCG CGAGAGCGCC ATCGTCCTCA ACAACATCAT CCTGTCAACG
GCCACGGCGG TGGTGCTGGT CGGCACCCTG TTCCCGCTGA TCCGCGAGGC CGTGGACGGC
GAGGCGGTGT CGGTGGGGCC GCCGTTCTTC AACATGACCT TCGTGCCGCT GATGGTGCTG
GCCATGGCCG TCCTGCCGGC GGGGCCGCTG CTGGCCTGGA AGCGCGGCGA CGCCCGGCTG
GTGGTCCGCC GCCTGTGGCT GGCCCTGGCC TGCGCCGCGA TCCTGGGCCT GGTCGCCTTC
GCCGTCGTCG CGCCGCGCAG CGCCCTGGCC AGCGCCGGTC TGGCGCTGGG CTTCTGGCTG
ATCGGCGGGG CTCTGCTGGA ACTGGCCGAG CGCGTGAAGG CCTTCCGCGC GCCCTGGGCC
GAGGTGCGTC GCCGGACGAC CGGCCTGCCG CGCGGCGCCT GGGGAACCAC CCTGGCCCAT
GCGGGCCTGG GCCTCTTCGT GCTGGGCGCA TCGTTCGAGA CCACCTGGCG GGTCGAAGCC
GCCCAGGCCC TGGGCGTCGG CGGTCAGCTG AAACTGGGCG CCTACGAGTT GACCCTGACC
GACGTCGGGA CGGTGGAGGG CTCCAACTAC ATCGCCGAGC GAGGGATCGT GAAGATTACC
AAGGCCGGCG CTCCTGTCTG CGAAGCCAAG CCCGAGCGCC GCTTCTATCC GACCGGCAGG
CAGACCACCT CGGAAGTGGC GATCTGCCCG AAGATCCTGG ACGACCTCTA CATCGTGCTC
GGCGAGCGCC GGGCGGGCGA GGGCGGCCAG CCGGCCTGGC TGGTGCGGGC CTTCGTCAAT
CCGTGGGTGC GGCTGATCTT CCTGGGGCCG CTGGTCATGG CGATCGGCGG ACTGGTCTCT
CTGTCGGACC GGCGTCTGCG GTTCGGGGTG GGCAAGCGCG CGGAGACGGC GGCGTGA
 
Protein sequence
MINEFGAYAL VLALVLSALQ MTLSAIGGAR RSSMLAGAGR GAAVGAFLCT ALTFLALIHA 
FVVSDFSVAN VAANSHTAKP MLYKVAGAWG SHEGSMLLWC LVLTGYGAAI ATFGEALPPR
LRAFAVAVQG ALGVLFLAYT VFASNPMARL VDVPVEGASL NPLLQDPALA FHPPFLYCGY
VGFSVVFSFA MAALIEGRVD AAWARWVRPW TLAAWSFLTV GITLGAFWAY YELGWGGWWF
WDPVENASFM PWLIGTALLH SAIVLEKRGA LPGWTVFLGL AAFTFSMLGA FLVRSGVLTS
VHAFAVDPTR GVLLLSIMGL AAGAGFVLFA LRAPSLNAGG LFAPVSRESA IVLNNIILST
ATAVVLVGTL FPLIREAVDG EAVSVGPPFF NMTFVPLMVL AMAVLPAGPL LAWKRGDARL
VVRRLWLALA CAAILGLVAF AVVAPRSALA SAGLALGFWL IGGALLELAE RVKAFRAPWA
EVRRRTTGLP RGAWGTTLAH AGLGLFVLGA SFETTWRVEA AQALGVGGQL KLGAYELTLT
DVGTVEGSNY IAERGIVKIT KAGAPVCEAK PERRFYPTGR QTTSEVAICP KILDDLYIVL
GERRAGEGGQ PAWLVRAFVN PWVRLIFLGP LVMAIGGLVS LSDRRLRFGV GKRAETAA