Gene Caul_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4678 
Symbol 
ID5902140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5056542 
End bp5058710 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content68% 
IMG OID641565197 
Productendothelin-converting protein 1 
Protein accessionYP_001686296 
Protein GI167648633 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.44527 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGT CGCGGTTCAG TTTTTCAACG GATCGCCACT TCATGAAACG AGTCTGGTTC 
GCCGCCGCCG CCATGGCCGC GCTGTCCTTG TCGGCCTTCG GCGCCCAGGC GCGCGAAGAC
CACGACCACG CCTGCCTCAA CGACGCCTGC ACGATGCAGT CGCTGTTCGT CGCCGCCGAC
ACCCCGGCGG CCGGCGACGC GGCCATGTCG CTGGACTCGC CCCGCTACGG AACCTGGGGT
TTCGACGCGG CGGGCATGGA CCGTTCGGTC AAGCCGGGCG ACGACTTCTA CAAGTTCGCC
AACGGGACCT GGGACGCCAA CACCGTCATC CCCAGCGACC GCACCCGCTA CGGCAATTTC
GACAAGCTGG CCGAACTGTC CGAGGCCCGC ACCAAGGCGA TCATCCTCGA GGCCGCCGCC
AGCGCCGGGG CCGACCCCGA CACCGTCAAG ATCGGCGCGG CCTACAAGGC CTTCATGGAC
GAGGCCCTGG CTGAAAAGCT GGACGCCAAG CCGATCGCGC CGGAGCTGGC TGGCATCCGC
AAGGTCAAGA CCAGGGACGA TTTCACGGCC CTGATGGGCA AGAACCCCAC CACGGGCTAT
GCGGCGATCC TGGGCCTGAA CATCACCCCC GACGCCAAGA ACCCGACCCG CTACGCCGTC
TACGCCTCGA CCGGCGGCCT CAGCCTGCCC GACCGCGACT ACTATCTCGA CGCCAAGTTC
GCCGAGAAGA AGACCGCCTA CGAGGCCTAT GTCGCCCAGA TGCTGACGAT GATCGGCTGG
GACAAGCCGG CCGAAAGCGC CAAGGCCGTC GTCGCCTTCG AGACCCGGAT GGCCGAGGCC
ACCTGGACCC GCGCCGCGCG CCGGGATCGC GACAAGACCT ACAACCCGAT GAGCCTGACC
GAACTTCAGG CCCTGACCCC GGGCTTCGCC TGGAACCGCT ATCTGGTCGG CACGGAACTG
CCCAAGATCG ACCGCGTGGT GGTGACCACC AACACCGCCT TCCCGGCCTT CGCCAAGATC
TATGCCGACA CCCCGCTGGA CACCCTGAAG GCCTGGCAGG CGTTCAAGGT GGCCGATGGC
GCCGCGCCGA TGCTGTCCAA GCGCTTCGTC GATGCTGCTT ACCAATTCCG CAACAAGACC
CTGGCCGGCC AGCCCGAGCA GAAGCCCCGC TGGAAGCGCG GCGTCGCGGC GGTCAACGGC
GAGCTGGGCG AGGCGGTCGG CCGCGTCTAT GTGGCGCGCT ACTTCCCGCC GGACTCCAAG
GCCAAGATGG TCGACCTGGT CGGCAACATC CGCGCGGTCC TCAAGACCCG CCTGGACAGC
CTCGACTGGA TGTCGCCGGA GACCAAGACC CAGGCCCAGG CCAAGCTGGC CCAGTTCACC
GTCAAGATCG GCTATCCCGA CACGTGGCGC GACTATTCCA AGCTGGAGAT CAAGGCCGAC
GACGTCTACG GCAACGCCAT CCGCTCGGGC GCCTTCGAGT GGCGCCATGA TGTCGAGCGC
CTGAACGGTC CGGTCGACAA GAGCGAGTGG GGCATGACCC CGCAGACGGT CAACGCCTAC
TACAACTCGG TCAATAACGA GATCGTCTTC CCCGCCGCCA TCCTGCAGGC CCCGTTCTTC
CATCCGGACG CCGATCCGGC CGTGAACTAC GGCGGCATCG GCGGGGTGAT CGGCCACGAG
ATCAGCCACG GCTTCGACGA CCAGGGCCGC AAGTCGGACG GCCTGGGGGT GCTGCGCGAC
TGGTGGACCG CGCAGGACGC GGCCAAGTTC AAGGCCCAGG CCGACAAGCT GGGCGCCCAG
TACGGCGCGT TCGAGCCGCT GCCCGGCGCC AAGGTCAACG GCCAGCTGAC CATGGGCGAG
AACATCGGCG ACATGGGCGG CCTGGCCTTC GCCCTGCAGG CCTATCGCGT CTCGCTGGGC
GGCAAGCCGG CCCCGGTGAT CGACGGCTTC ACCGGCGACC AGCGGGTCTA TCTCGGCTGG
GCCCAGGTGT GGCGCTCGAA GATCCGCGAC GACGCCCTGC GCCAGCAGGT GGTCAGCGAC
CCCCACTCGC CGGCCTATTA CCGCGTCAAC GGCACGATCC GGAACCAGGA CGGCTGGTAC
GGCGCCTTCG ACGTGGCGCC GGGCGACAAG CTGTACGTCG CGCCGGAGGA CCGGGTTCGG
ATCTGGTAG
 
Protein sequence
MAKSRFSFST DRHFMKRVWF AAAAMAALSL SAFGAQARED HDHACLNDAC TMQSLFVAAD 
TPAAGDAAMS LDSPRYGTWG FDAAGMDRSV KPGDDFYKFA NGTWDANTVI PSDRTRYGNF
DKLAELSEAR TKAIILEAAA SAGADPDTVK IGAAYKAFMD EALAEKLDAK PIAPELAGIR
KVKTRDDFTA LMGKNPTTGY AAILGLNITP DAKNPTRYAV YASTGGLSLP DRDYYLDAKF
AEKKTAYEAY VAQMLTMIGW DKPAESAKAV VAFETRMAEA TWTRAARRDR DKTYNPMSLT
ELQALTPGFA WNRYLVGTEL PKIDRVVVTT NTAFPAFAKI YADTPLDTLK AWQAFKVADG
AAPMLSKRFV DAAYQFRNKT LAGQPEQKPR WKRGVAAVNG ELGEAVGRVY VARYFPPDSK
AKMVDLVGNI RAVLKTRLDS LDWMSPETKT QAQAKLAQFT VKIGYPDTWR DYSKLEIKAD
DVYGNAIRSG AFEWRHDVER LNGPVDKSEW GMTPQTVNAY YNSVNNEIVF PAAILQAPFF
HPDADPAVNY GGIGGVIGHE ISHGFDDQGR KSDGLGVLRD WWTAQDAAKF KAQADKLGAQ
YGAFEPLPGA KVNGQLTMGE NIGDMGGLAF ALQAYRVSLG GKPAPVIDGF TGDQRVYLGW
AQVWRSKIRD DALRQQVVSD PHSPAYYRVN GTIRNQDGWY GAFDVAPGDK LYVAPEDRVR
IW