Gene Caul_4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4017 
Symbol 
ID5901479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4352676 
End bp4353737 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content68% 
IMG OID641564538 
Productproline imino-peptidase 
Protein accessionYP_001685640 
Protein GI167647977 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAT CTCTTGTTCT GGCCGTTGCG CTGGCTTTCG CCGGCGCGCC CCATCTAGTC 
CTGGCCGCCC AGCCGGCCGC GGTCGCCGTG CCGGCGAACG CGGCCATCAA TGAAGAGGGC
TTCGTCCGCA TCGGGGGGAT CGAGCAGTGG GTCACCATCC ACGGCGAGGA CCGCTCCAAG
CCGGTGATCC TGATGGCGCA CGGCGGTCCG GGCAATCCGA TGACCCCCTA CGCCCACCCC
TATTTCCAGG CGTGGGAGAA GGACTTCGTC ATCGTCCAGT GGGACCAGCG CGGCGCGGGC
ATGACCTATG GCCGCAACCC GCCGGCCCAG GGCGAGCACC TGAGCGTCGA GCGGCTGCGC
GACGACGGCA TCGAGGTGGC GACCTATGCG GCCAAGCATC TGGGCAAGCC CAAGGTGATC
CTGATGGGCG GCTCGTGGAG TTCGATCCTG GGGACCCACA TGGCCAAGGC GCGGCCCGAC
CTGTTCTACG CCTATGTCGG CTCCTCGCAC CTGGCCCGTT CGGCCGACAA TCTGAAGGCC
TCGTACGACC GGACGCTGGG CCTGGCGCGG GCCGGCGGCG ACCAGGACGC GATCGGCAAG
CTGGAGGCCA TGGGGCCGCC GCCCTGGACC AACCCGCGAA ACTTCGGGAT CATCCGCCGC
ATCACCCGCA AGTACGAGGC CGCACGCACT GATCCCGCGC CCGCCACGTG GATGGAGCCC
AATCCCGTCT ACGCCACGGA GAAGGCCCTG GCCGACTACG AGGGCGGCGA GGACTATTCC
TACATCGAGT TCGTGGGGAT GAACGGCGAG GGTATGTATT CGAAGACCGA TCTCTACGCC
CTGGGGCCGC AATTCAAGCT GCCGGTGTTC GTGATCCTGG GCGAGCAGGA CCTGGTCTCG
ACGCCCGAGG TCGCCCGCGC CTGGTTCGAT ACCTTGCAGG CGCCTGACAA GGCGTTCGTG
CTGCTGCCGC GCACGGGGCA CGATCCGAAC CCGGCCATGG CGGCGGCGCA GCTGGAGATC
TTGAAGACGC GGGTCTTGCC GCTGATCGGA AAGGGCGGCT GA
 
Protein sequence
MSKSLVLAVA LAFAGAPHLV LAAQPAAVAV PANAAINEEG FVRIGGIEQW VTIHGEDRSK 
PVILMAHGGP GNPMTPYAHP YFQAWEKDFV IVQWDQRGAG MTYGRNPPAQ GEHLSVERLR
DDGIEVATYA AKHLGKPKVI LMGGSWSSIL GTHMAKARPD LFYAYVGSSH LARSADNLKA
SYDRTLGLAR AGGDQDAIGK LEAMGPPPWT NPRNFGIIRR ITRKYEAART DPAPATWMEP
NPVYATEKAL ADYEGGEDYS YIEFVGMNGE GMYSKTDLYA LGPQFKLPVF VILGEQDLVS
TPEVARAWFD TLQAPDKAFV LLPRTGHDPN PAMAAAQLEI LKTRVLPLIG KGG