Gene Caul_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1725 
Symbol 
ID5899180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1814402 
End bp1816450 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content69% 
IMG OID641562215 
Productendothelin-converting protein 1 
Protein accessionYP_001683352 
Protein GI167645689 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.172464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTGA CCCGTTCGCT GCTCGCCTGC GCCGCCGCCT GCGCGCTGCT CTCCGCCGCT 
CCAGCCTTCG CCGCCGACCT CAAGAGCGGC GTCGATCGCT CGGCTTTCGA CACCGCCGTC
AAGCCTGGCG ACGACTTCTG GACCTACGCC AACGGCGCGG CGGTCAAGGC CAATCCGATC
CCCGCCGACC GCAGCAGCTA CGGCGTCGCC GTCCAGCTGA TCGAGGAGGC GTCCAAGCGC
ACGGTCGACC TGATCCAGAC CGCGGCCAAG GACGGCGGTT CCCCCGACGC CAAGAAGGTC
GGCGACTACT ACGCCAGCTA CATGGACGAG GCGGCCATCG AGAAGGCCGG CATCGCGCCG
CTGAAGCCGG GCCTGGCGCG GATCGCCGGG ATCAAGACCC GCACCGACCT GGCCGCCGTG
CTGGGCGGCA ATATCCGGGC CGACGTCGAT GCGCTGAACG CCACTGACTT CTACACCGAC
AACGTCCTGG GCCTGTGGGC CTCGCCGTCG TTCGACGACC CCACCAAGTA CGCCCCGTTC
CTGCTGCAGG GCGGTCTGGG CATGCCCGAC CGCGAATACT ACCTGTCAGA CAAGGACGCG
ATGAAGGCGA TCCGGGCCAA GTACGTGGCC CATATCGCCA AGATCCTGGC CTTGGGCGGC
GTGCCCGACG CCGAGGCCAA GGCCGCGCGG ATCATGGCCC TGGAGACCAA GATCGCCCAG
GCCTCGGCCA GCCGCGCCGA CAGCGCCGAT GTCCAGAAGG CCAACAACAG CTGGACCCCG
GCCGACTTCG CGGCCAAGGC CCCGGGGCTG GACTGGAAGA CCTTCTTCGC CTCGGCCGGC
CTGGCCGACC AGCCGCGCTT CGTCGTCTGG CACCCGACCA TGGTCGTGGG CCTGGGCAAG
GTGGCCGCCG ACGAGAGCAT CGACACCTGG AAGGACTACC TGACCTTCCA CTACCTCGAC
CACTATTCGA ACCTGCTGCC CAAGGCTTTC GTCGACGAGC GCTTCGCCTT CTATGGCCAG
GCCCTGCAGG GGACGCCGCA GCTGGCGGCC CGCTGGAAGC GCGGCGTCAA CTCGACCAAC
GCCGTGCTCG GCGAGGTGGT CGGCAAGCTC TATGTCGACA AGTACTTTCC GGCTCGCTCC
AAGGCCGAGG TCGCGGCCAT GGTCGACAAT ATGAAGGCCG CCTTCGTGCG CCGCATCGAC
GCCCTGGACT GGATGGCCCC GGAGACCAAG GCCGAGGCTC GCCGCAAGGT GGAAGTGCTG
AAGGTCGGCG TCGGCTATCC CGATAAATGG CGCGACTACA CCGCCCTGCA GATCGTCCGC
GGCGACCCGG TCGGCAACGT CCAGCGCTCG GAGAAATTCG AGACCGCCTA CTGGATCGGC
CAGCTGGGCA AGCCGGTTGA TCGCGGCCAG TGGGTGATGA CGCCCCAGAC CGTCAACGCC
GTGAACCTGC CGATGCTGAA CGGCCTGAAC TTCCCGGCCG CCATCCTGCA GGCTCCGTAT
TTCGACGTCG AGGCCGACGC GGCGGCCAAT TACGGCGGCA CCGGCGGCAC GATCGGCCAC
GAGATCAGCC ACAGCTTCGA CGACCAGGGC GCGCAGTTCG ACAGCCAAGG CCACCTGCGC
AACTGGTGGA CCCCGGCCGA CTACGACCAC TTCCAGAAGG CGGGCGCGGC CCTGGCCGCG
CAGTTCGACG GCTACAAGCC GTTCCCTGAC CTGGCCGTGA ACGGCAAGCA GACGCTCAGC
GAGAACATCG CCGACGTCGC CGGCCTGGCC GCGGCCCTGG ACGCCTACCA CGCCTCGCTG
GGCGGCAAGC CGGCCCCGGT GATCGACGGC TTCACGGGCG ACCAGCGGTT CTTCCTGGCC
TTCGCCCAGA GCTGGCGCGG CTATGATCGG CCCGAGGCCC TGCGCCAGCA ACTGGTCACC
GACGGCCACG CGCCCGATCA GTACCGCGCC GACACGGTGC GCAATCTCGA CGCCTGGTAC
GCCGCCTTCG ACATCAAGCC GGGCGACGCG CTGTACCTGA CGCCCGAGCA GCGCGTGAAG
GTCTGGTAG
 
Protein sequence
MTLTRSLLAC AAACALLSAA PAFAADLKSG VDRSAFDTAV KPGDDFWTYA NGAAVKANPI 
PADRSSYGVA VQLIEEASKR TVDLIQTAAK DGGSPDAKKV GDYYASYMDE AAIEKAGIAP
LKPGLARIAG IKTRTDLAAV LGGNIRADVD ALNATDFYTD NVLGLWASPS FDDPTKYAPF
LLQGGLGMPD REYYLSDKDA MKAIRAKYVA HIAKILALGG VPDAEAKAAR IMALETKIAQ
ASASRADSAD VQKANNSWTP ADFAAKAPGL DWKTFFASAG LADQPRFVVW HPTMVVGLGK
VAADESIDTW KDYLTFHYLD HYSNLLPKAF VDERFAFYGQ ALQGTPQLAA RWKRGVNSTN
AVLGEVVGKL YVDKYFPARS KAEVAAMVDN MKAAFVRRID ALDWMAPETK AEARRKVEVL
KVGVGYPDKW RDYTALQIVR GDPVGNVQRS EKFETAYWIG QLGKPVDRGQ WVMTPQTVNA
VNLPMLNGLN FPAAILQAPY FDVEADAAAN YGGTGGTIGH EISHSFDDQG AQFDSQGHLR
NWWTPADYDH FQKAGAALAA QFDGYKPFPD LAVNGKQTLS ENIADVAGLA AALDAYHASL
GGKPAPVIDG FTGDQRFFLA FAQSWRGYDR PEALRQQLVT DGHAPDQYRA DTVRNLDAWY
AAFDIKPGDA LYLTPEQRVK VW