Gene Caul_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3441 
Symbol 
ID5900896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3722754 
End bp3724757 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content68% 
IMG OID641563947 
Productendothelin-converting protein 1 
Protein accessionYP_001685066 
Protein GI167647403 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCA TTTTCCTGCT GGCCGCCGTC TCGGCGGTCG CCCTTTCTTC CACGGCCCTC 
GCGGCCGATC CGGCCGCCAA GCCGACCTAC GGCGCCTGGG GCGTCGACCT GACGGCTCGC
GACACCAGCG TCTCGCCGGG GACCAATTTC GACAAGTACG CCAACGGCGC CTGGATGGCC
CGCACCGAGA TCCCCGGCGA CCAGGGCAGC GCCGGCGTCG GCAACGACGT CTACAACCGC
GCCCAGGACC AACTTCGCAC CCTGATCGAG ACGGCCGACG GCGCGACCCA GATCGGCGCC
CTCTACAAGA GCTTCAGCAA CGAGGCCCAG GTCGAGCAGA TCGACGACAA GCCGCTGAAG
GTCGACCTGG CCCAGATCGC CGCGATCAAG ACCAAGGCCC AGTTCGCCGA CAGCCTGGCC
CGCGCCCACG GCGGCTTCGG CGCCGACCTG TTCTCGCTGG ACATCTATCC CGACGCCAAG
AACCCCGAGC TGAACGCCCT CTATATCGGC CAGTCGGGCC TGGGCCTGCC GGACCGCGAC
TACTACCTGA CCGAAACCTT CAAGCCGCAG CTGGCCGCCT ATACCGCCTT CATCGAGCGG
TCGCTGAAGG CCGCCGGCTA TGCCGACCCG GCCAAGGCCG CCGCCGACGT CGTGGCTTTC
GAGACCGCGA TCGCCAAGAT CAGCTGGGAG GTCGCCGAGC GCCGCGACAT CGACAAGACC
TACAACCCCG TCACCCTCGC CGAACTGACC GCCTACGCGC CCTTCCCCTG GGCCGACTAT
CTGGCCAAGG CCGGCATGCC GGGCCTGCCC AAGATGATCC TGGGCGAGAA GACCGCCGTG
CGCGACATCG CAGGGGTCTA TGCCGACACG CCGCTGGAGA CCCTGAAGGC CTGGCAGACC
TTCCACGTTA TCGAGGAGGC CTCGCCCTAC CTGTCCACGC GGTTCGTCAA CAGCCGCTTC
GAGTACGTCA AGGCGCTGAC CGGCCAGACC GTGCTGCGTC CCCGCTGGAA GCGCGGCGTC
CAGCTGGTCG ACGGCAGCCT GGGCGAGGTG GTCGGCCAGA CCTATGTCGC CAAGTACTTC
CCGCCCAGCT CGAAGGCCCA GATGGTCGAG CTGATCGCCA ACCTGAAGAC GGCCATGGCC
GCGCGCATCC AGGCCGCCCC CTGGATGAGC CCGGCGACCA AGGCCGAGGC CCAGGCCAAG
CTCGGCAAGA TGCAGGTGAT GGTCGGCTAT CCCGACAAGT GGCGCGACTA TTCGGGCCTG
AAGCTCGACG TCGGTGACCT TTACGGCAAC GTCAAGCGCA GCCGGGCCTT CGAATGGGCC
TATCAGCTGA CCGACCTGGG CAAGGCCGTC GACCACGGCA AGTGGGGCAT GACCCCGCAG
ACGGTGAACG CCTACAACGG CGGGGTGGAG AACAAGATCG TCTTCCCGGC CGGCATCCTG
CAGGCCCCGT TCTTCGACCC CGCCGCCGAC CCGGCGGTGA ACTACGGCGC GATCGGCGCG
ATCATCGGCC ACGAGATCAG CCACGGCTTC GACGACCAGG GCCGCAAGAT CGACGCCACC
GGCAAGCTGC GCGACTGGTG GACGGCCGAG GACGCCAAGC GCTTCGACGC CCAGGCCGCC
AACCTCGGCA AGCAGTATGA CGGCTACGAA GCCGTGCCCG GCATGTTCAT CAACGGCAAG
CTGACCATGG GCGAGAACAT CGCCGACCTG GCCGGGCTGC AGGTGGCGCT GGACGCCTAC
CACGCCTCGC TGGGCGGCAA GCCCGCGCCG GTGATCGACG GCTTCACCGG CGAGCAGCGC
CTGTTCCTGG CCTTCGCCCA AGCCTGGCGG GACAAGACCC GCGAGGACGC CCTGAAATCG
CAGATGGCCT CGGACCCGCA CTCGCCAGCC GCCTTCCGGG TCATCGGCCC CACCCGCAAT
GTCGACGCCT GGTACGACGC CTTCGGCGCC AAGCCGGGCG ACGCCTACTA CCTCAAGCCG
GAAGACCGCT CGCGGGTGTG GTGA
 
Protein sequence
MMRIFLLAAV SAVALSSTAL AADPAAKPTY GAWGVDLTAR DTSVSPGTNF DKYANGAWMA 
RTEIPGDQGS AGVGNDVYNR AQDQLRTLIE TADGATQIGA LYKSFSNEAQ VEQIDDKPLK
VDLAQIAAIK TKAQFADSLA RAHGGFGADL FSLDIYPDAK NPELNALYIG QSGLGLPDRD
YYLTETFKPQ LAAYTAFIER SLKAAGYADP AKAAADVVAF ETAIAKISWE VAERRDIDKT
YNPVTLAELT AYAPFPWADY LAKAGMPGLP KMILGEKTAV RDIAGVYADT PLETLKAWQT
FHVIEEASPY LSTRFVNSRF EYVKALTGQT VLRPRWKRGV QLVDGSLGEV VGQTYVAKYF
PPSSKAQMVE LIANLKTAMA ARIQAAPWMS PATKAEAQAK LGKMQVMVGY PDKWRDYSGL
KLDVGDLYGN VKRSRAFEWA YQLTDLGKAV DHGKWGMTPQ TVNAYNGGVE NKIVFPAGIL
QAPFFDPAAD PAVNYGAIGA IIGHEISHGF DDQGRKIDAT GKLRDWWTAE DAKRFDAQAA
NLGKQYDGYE AVPGMFINGK LTMGENIADL AGLQVALDAY HASLGGKPAP VIDGFTGEQR
LFLAFAQAWR DKTREDALKS QMASDPHSPA AFRVIGPTRN VDAWYDAFGA KPGDAYYLKP
EDRSRVW