Gene Caul_4585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4585 
Symbol 
ID5902047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4959045 
End bp4960466 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content72% 
IMG OID641565104 
Productleucyl aminopeptidase 
Protein accessionYP_001686203 
Protein GI167648540 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAC CGATCATCGC CTCCTCGTCC CGCTCCGCGC AGGATGGCGC GGCGACACCC 
ATCCACTGCC TGTACGAGGC TGAACTGGCC GCCTTCCTCG AGGCGCGGCC CAGCTTCGTG
AAGGGCTTCG TGGCGCTGGA GGACTTCAAG GCCAAGGCCG GCCAGGTGCT GGTATTGCCG
ACGCCGCAAG GGGCCGTGGA CCGGGTGTTG CTGGGATTGG GGGCCAAGGG CAAGGCCGAC
GCCATGCTGT TTCGCGCCCT GCCCGGTCGC CTGCCGGCCG GCGACTACCG CCTGGCGGCG
ATCCCCGACG GGCTGGACGC CGGCCAGATC GCCTTGGCCT TCGCGCTGGG CGGCTACAGG
TTCGACCGCT ATCGCCCGAA GGCCGGTGAA GGGCCGCGAC GACTGGTCGC GGACGAAGGC
GTCGATCTGG ACGAGGTCCG TAGCGTCGCC CACGCCTGCG CCCTGGCCCG CGACATGATC
AACACCCCGG CCAACGACAT GGGGCCGCTG CAGATCGAGA CCATCGCCCG CGAGATCGCC
GAGCGCCATG GCGCGACCCT GAGCGTGGTC ACGGGCGACG ATCTGCTGGA GCAGAACTAC
CCCGCCGTCC ACGCGGTCGG CCGCGCCGCT GTCCCGGCCC GCGGCCCCCG CATGCTGGAG
ATCACCTGGG GCGACGCCTC GCGTCCGCGC GTGGCGCTGA TCGGCAAGGG CGTGGTGTTC
GACACCGGCG GTCTCGACAT CAAGCCGTCG TCGGGCATGC GGTTGATGAA GAAGGACATG
GGCGGCGCCG CCCACGCCCT GGCCCTGGGC CGGATGGTCA TGGCCGCCGG CCTGCCCGTA
GCGCTGAGCG TGCTGGTGCC CGTGGCCGAG AACGCCATCG CCGGCGACGC CATGCGGCCC
GGCGACGTGC TGGCCACCCG CGCCGGCCTG ACGGTGGAGG TCGGCAACAC CGACGCCGAG
GGCCGGCTGA TCCTGGCCGA CGCCCTGGCG CGCGCCGCCG AATTGGAACC GGTCCTGACC
ATCGACCTAG CCACCCTGAC CGGCGCGGCG CGCGTGGCGC TGGGGCCGCA GGTGATCCCG
TTCTACACCC CCGACGACGA CCTGGCCCTG GAGATCGAGG AGGGCGCGCG CGAGGCCGTC
GACCCGGTCT GGCGCATGCC GCTGTGGGAC GGCTATCGCG AGGCGATCGA GGGCGACATC
GCCGACCTGA AGAACGATCC CGACGCCTGG GCCCAGGCCG GCTCGATGAC GGCGGCCCTG
TTTCTGCAAC GCTTCGCCCC GACCACGGGG GCCTGGGTGC ATTTCGACAT CTTCGCCTGG
AATCCCAAGC AGCGCCCGGG CTTCGCCTCG GGCGGCGAGG CCCAGGTGAT CCGCGGGCTC
TACGGCATGC TCAAGTCGCG GTTCCCGAAG GTTCAAGCAT GA
 
Protein sequence
MSEPIIASSS RSAQDGAATP IHCLYEAELA AFLEARPSFV KGFVALEDFK AKAGQVLVLP 
TPQGAVDRVL LGLGAKGKAD AMLFRALPGR LPAGDYRLAA IPDGLDAGQI ALAFALGGYR
FDRYRPKAGE GPRRLVADEG VDLDEVRSVA HACALARDMI NTPANDMGPL QIETIAREIA
ERHGATLSVV TGDDLLEQNY PAVHAVGRAA VPARGPRMLE ITWGDASRPR VALIGKGVVF
DTGGLDIKPS SGMRLMKKDM GGAAHALALG RMVMAAGLPV ALSVLVPVAE NAIAGDAMRP
GDVLATRAGL TVEVGNTDAE GRLILADALA RAAELEPVLT IDLATLTGAA RVALGPQVIP
FYTPDDDLAL EIEEGAREAV DPVWRMPLWD GYREAIEGDI ADLKNDPDAW AQAGSMTAAL
FLQRFAPTTG AWVHFDIFAW NPKQRPGFAS GGEAQVIRGL YGMLKSRFPK VQA