Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4678 |
Symbol | |
ID | 5902140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5056542 |
End bp | 5058710 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565197 |
Product | endothelin-converting protein 1 |
Protein accession | YP_001686296 |
Protein GI | 167648633 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3590] Predicted metalloendopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.44527 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGT CGCGGTTCAG TTTTTCAACG GATCGCCACT TCATGAAACG AGTCTGGTTC GCCGCCGCCG CCATGGCCGC GCTGTCCTTG TCGGCCTTCG GCGCCCAGGC GCGCGAAGAC CACGACCACG CCTGCCTCAA CGACGCCTGC ACGATGCAGT CGCTGTTCGT CGCCGCCGAC ACCCCGGCGG CCGGCGACGC GGCCATGTCG CTGGACTCGC CCCGCTACGG AACCTGGGGT TTCGACGCGG CGGGCATGGA CCGTTCGGTC AAGCCGGGCG ACGACTTCTA CAAGTTCGCC AACGGGACCT GGGACGCCAA CACCGTCATC CCCAGCGACC GCACCCGCTA CGGCAATTTC GACAAGCTGG CCGAACTGTC CGAGGCCCGC ACCAAGGCGA TCATCCTCGA GGCCGCCGCC AGCGCCGGGG CCGACCCCGA CACCGTCAAG ATCGGCGCGG CCTACAAGGC CTTCATGGAC GAGGCCCTGG CTGAAAAGCT GGACGCCAAG CCGATCGCGC CGGAGCTGGC TGGCATCCGC AAGGTCAAGA CCAGGGACGA TTTCACGGCC CTGATGGGCA AGAACCCCAC CACGGGCTAT GCGGCGATCC TGGGCCTGAA CATCACCCCC GACGCCAAGA ACCCGACCCG CTACGCCGTC TACGCCTCGA CCGGCGGCCT CAGCCTGCCC GACCGCGACT ACTATCTCGA CGCCAAGTTC GCCGAGAAGA AGACCGCCTA CGAGGCCTAT GTCGCCCAGA TGCTGACGAT GATCGGCTGG GACAAGCCGG CCGAAAGCGC CAAGGCCGTC GTCGCCTTCG AGACCCGGAT GGCCGAGGCC ACCTGGACCC GCGCCGCGCG CCGGGATCGC GACAAGACCT ACAACCCGAT GAGCCTGACC GAACTTCAGG CCCTGACCCC GGGCTTCGCC TGGAACCGCT ATCTGGTCGG CACGGAACTG CCCAAGATCG ACCGCGTGGT GGTGACCACC AACACCGCCT TCCCGGCCTT CGCCAAGATC TATGCCGACA CCCCGCTGGA CACCCTGAAG GCCTGGCAGG CGTTCAAGGT GGCCGATGGC GCCGCGCCGA TGCTGTCCAA GCGCTTCGTC GATGCTGCTT ACCAATTCCG CAACAAGACC CTGGCCGGCC AGCCCGAGCA GAAGCCCCGC TGGAAGCGCG GCGTCGCGGC GGTCAACGGC GAGCTGGGCG AGGCGGTCGG CCGCGTCTAT GTGGCGCGCT ACTTCCCGCC GGACTCCAAG GCCAAGATGG TCGACCTGGT CGGCAACATC CGCGCGGTCC TCAAGACCCG CCTGGACAGC CTCGACTGGA TGTCGCCGGA GACCAAGACC CAGGCCCAGG CCAAGCTGGC CCAGTTCACC GTCAAGATCG GCTATCCCGA CACGTGGCGC GACTATTCCA AGCTGGAGAT CAAGGCCGAC GACGTCTACG GCAACGCCAT CCGCTCGGGC GCCTTCGAGT GGCGCCATGA TGTCGAGCGC CTGAACGGTC CGGTCGACAA GAGCGAGTGG GGCATGACCC CGCAGACGGT CAACGCCTAC TACAACTCGG TCAATAACGA GATCGTCTTC CCCGCCGCCA TCCTGCAGGC CCCGTTCTTC CATCCGGACG CCGATCCGGC CGTGAACTAC GGCGGCATCG GCGGGGTGAT CGGCCACGAG ATCAGCCACG GCTTCGACGA CCAGGGCCGC AAGTCGGACG GCCTGGGGGT GCTGCGCGAC TGGTGGACCG CGCAGGACGC GGCCAAGTTC AAGGCCCAGG CCGACAAGCT GGGCGCCCAG TACGGCGCGT TCGAGCCGCT GCCCGGCGCC AAGGTCAACG GCCAGCTGAC CATGGGCGAG AACATCGGCG ACATGGGCGG CCTGGCCTTC GCCCTGCAGG CCTATCGCGT CTCGCTGGGC GGCAAGCCGG CCCCGGTGAT CGACGGCTTC ACCGGCGACC AGCGGGTCTA TCTCGGCTGG GCCCAGGTGT GGCGCTCGAA GATCCGCGAC GACGCCCTGC GCCAGCAGGT GGTCAGCGAC CCCCACTCGC CGGCCTATTA CCGCGTCAAC GGCACGATCC GGAACCAGGA CGGCTGGTAC GGCGCCTTCG ACGTGGCGCC GGGCGACAAG CTGTACGTCG CGCCGGAGGA CCGGGTTCGG ATCTGGTAG
|
Protein sequence | MAKSRFSFST DRHFMKRVWF AAAAMAALSL SAFGAQARED HDHACLNDAC TMQSLFVAAD TPAAGDAAMS LDSPRYGTWG FDAAGMDRSV KPGDDFYKFA NGTWDANTVI PSDRTRYGNF DKLAELSEAR TKAIILEAAA SAGADPDTVK IGAAYKAFMD EALAEKLDAK PIAPELAGIR KVKTRDDFTA LMGKNPTTGY AAILGLNITP DAKNPTRYAV YASTGGLSLP DRDYYLDAKF AEKKTAYEAY VAQMLTMIGW DKPAESAKAV VAFETRMAEA TWTRAARRDR DKTYNPMSLT ELQALTPGFA WNRYLVGTEL PKIDRVVVTT NTAFPAFAKI YADTPLDTLK AWQAFKVADG AAPMLSKRFV DAAYQFRNKT LAGQPEQKPR WKRGVAAVNG ELGEAVGRVY VARYFPPDSK AKMVDLVGNI RAVLKTRLDS LDWMSPETKT QAQAKLAQFT VKIGYPDTWR DYSKLEIKAD DVYGNAIRSG AFEWRHDVER LNGPVDKSEW GMTPQTVNAY YNSVNNEIVF PAAILQAPFF HPDADPAVNY GGIGGVIGHE ISHGFDDQGR KSDGLGVLRD WWTAQDAAKF KAQADKLGAQ YGAFEPLPGA KVNGQLTMGE NIGDMGGLAF ALQAYRVSLG GKPAPVIDGF TGDQRVYLGW AQVWRSKIRD DALRQQVVSD PHSPAYYRVN GTIRNQDGWY GAFDVAPGDK LYVAPEDRVR IW
|
| |