Gene Caul_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0003 
Symbol 
ID5897715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2749 
End bp5409 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content69% 
IMG OID641560486 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001681639 
Protein GI167643976 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.350265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAC TCCTTTCCTC CGCGGCTGCC GCCGTGCTCG TTTTTGCCGC AGGTTCGGGC 
CTGGCCGCGA CAGTCGCCAA GTCGGCCGCT CCCGTCGCCA GGCCGGCGAT CACCGCCGCC
AACGTCACGA CCCAGTTGCC GCGCGGCGTG GTCCCGACCC ACTATGATCT GGCCTTCACC
CCCGACGCCG ACAAGCTGAC CTTCACCGCC TCGGTGAAGA TCGCCATCGA CGTGGTCAAG
CCGACCAACA CCATCACCCT GCAGGCCGCC GACCTGGCGT TCGCCAAGGC CGAGATCGCC
GGGATCGGCG CCGCCAAGGT CAGCCTCGAC GCCGAGGCCC AGACCGCCAC CTTCACCTTC
GACAAGGTGG TGACCAAGGG CGCCCACGTC CTGGCCCTCG ACTATAGCGG CAAGATCTAC
AAGCAGGCCG CGGGCCTGTT CGCCCTGGAC TACGACACCG ACCAGGGCAA GAAGCGCGCC
CTCTACACCC AGTTCGAGAA CAGCGACGCC CGCCGCTTCA TCCCCTCGTG GGACGAGCCG
TTCTTCAAGG CCACCTACGA CGTCCAGGTC ACCGTCCCGA CCGGCCAGAT GGCCATCGGC
AACATGCCGA TCGCCAAGAC GCAGGACCTG GGCGGCGGCA AGTCCAAGAT CACCTTCGCC
ACCTCGCCGA AGATGTCGAC CTATCTGCTG TTCTTCGGCC TGGGCGAGTT CGACCGCGCC
ACCGCCAAGG TCGGCGACGT CGAGATGGGG GTGATCACCA AGAAGGGCGA CCTGGCCAAG
GCCGATTTCG CCCTGAAGTC GTCGGGTCCG ATCCTGCAAT GGTACAACGA CTATTTCGGC
GCGCCCTACC CGCTGCCGAA GCTGGACCAC ATCGCCGCCC CCGGCCAGAG CCAGTTCTTC
AGCGCCATGG AGAACTGGGG CGCGATCTTC TACTTCGAGT ACGCCCTGCT GGAAGATCCC
GCGATCTCGA CCCAGAACGA TCGCGAGAAC ATCTACACCA CCGTGGCGCA CGAAATGGCC
CACCAGTGGT TCGGCGACCT GGTCACCATG TCGTGGTGGG ACGACCTGTG GCTGAACGAA
GGCTTCGCCT CGTGGATGGA AAGCCGGGCC ACCGAGCACT TCCACCCCGA ATGGAACACC
AGCCTGGCGG CGGTCGGCGG CCGTGAATAC GCCATGGGCC TGGATTCGCT GGCCACCACC
CACCCGGTCG TGCAGCACGT CGAAACCGTC GACCAGGCCA GCCAGGCCTT CGACGGGATC
ACCTACCAGA AGGGCCAGGC GGTCATCAGC ATGCTCGAGG CCTATGTCGG CCCCGAGGCC
TGGCGCGGCG GCGTGCGCGC CTACATCAAG GCCCACGCCC ACGGCAACAC CACGACCGAC
GACCTGTGGG CCGAGGTCGA GAAGGCCGCC GGCAAGCCGA TCACCGCCAT CGCCCACGAC
TTCACGCTGC AGCCGGGGAT TCCGTTGATC ACGGTCGAGG CCGGAGCCTG CGCGGCCGGC
AAGACCCCCG TCTCCCTGAC CCAGGGCGAG TTCAGCCGCG ACAAGCCGAC CAAGACCCCC
CTGGCCTGGC GCGTGCCGGT CTCGGCCCAG GTCGCCGGCT CCAGCACGGT GGCCAAGACC
CTGGTCGTGG GCGGCAAGGG CTCGCTCAGC GTCGATGGCT GCGGTCCGGT GATCGTCAAC
GCCGGCCAGG CTGGCTACTT CCGCACCCTC TATGCGCCCA AGGCCTTCGC CGGCGTGTCG
GCCAGCTTCG CCAAGCTGCC GGCCATCGAC CAGCTGGGCG TCATTTCCGA CGCCTGGGCG
CTGGGCCTGA ACGGCCAGCA GGCCGTCACC GACGCCCTGG ATCTGATCAT GGCCACGCCG
GCCGACGCCG ATCCTCAGGT GTGGGGCAAG GTCGCGGGCG TCCTGACCAA CATCAACGGC
ATGTATGACA GCGCGCCCGC CGATCGCGCG GCGTTCCGCA AGCTGGCCAT CGCCCGCCTG
TCGCCGGCCT TCGCGCAGGT GGGTTGGACG GCCAAGCCGG GCGAAGCCGG CACGATCGCT
ACGCTGCGGT CGACCCTGAT CACCTCGCTG GGCGCCCTGG GTGATCCGGC GGTGGTCGCC
GAGGCCAAGC GTCGCTACGC CGCCGACAAG ACCGATCCGG CCGCCGTGCC CGGCCCGCTG
CGCAAGGCCA TCCTGGCCAC CGTGGCCCGC AACGCCGACG CCGCGACCTG GGACGCCCTG
CACGAACAGG CCAAGGCCGA GAAGACCCCG CTGATCCGCG ACCAGCTCTA CACCCAGCTG
GCCTCGGCCG AGGACGACGC CCTGGCCGCC AAGGCCCTGG AATTGGCCCT GACCGACGAG
CCGGGCGAGA CCCTGTCGGC CAACATGATC TCGCGGGTGT CCGGCCTGCA CGCCGACATG
GCCTTCGACT TCGCCGTCGC CCACAAGGAC GCGGTCAACA GCAAGGTCGA CGCGGCCTCG
TCGACCAAGT TCATTCCGGG CCTGGCGCGC GGTTCGGCCG ATCCGGCGAT GATCGGCAAG
GTCACGGCCT ACGCGGCCGC CAACCTGCCG GCCGGCTCGC GCGGCGAGGC CGACAAGTCG
GTGGCCTCGA TCACCGACCG CATCAAGGCC CGCAAGGCCG CCCTGCCGCA GATCACCGCC
TGGGTGGCCA AGAAGGGCTA A
 
Protein sequence
MRRLLSSAAA AVLVFAAGSG LAATVAKSAA PVARPAITAA NVTTQLPRGV VPTHYDLAFT 
PDADKLTFTA SVKIAIDVVK PTNTITLQAA DLAFAKAEIA GIGAAKVSLD AEAQTATFTF
DKVVTKGAHV LALDYSGKIY KQAAGLFALD YDTDQGKKRA LYTQFENSDA RRFIPSWDEP
FFKATYDVQV TVPTGQMAIG NMPIAKTQDL GGGKSKITFA TSPKMSTYLL FFGLGEFDRA
TAKVGDVEMG VITKKGDLAK ADFALKSSGP ILQWYNDYFG APYPLPKLDH IAAPGQSQFF
SAMENWGAIF YFEYALLEDP AISTQNDREN IYTTVAHEMA HQWFGDLVTM SWWDDLWLNE
GFASWMESRA TEHFHPEWNT SLAAVGGREY AMGLDSLATT HPVVQHVETV DQASQAFDGI
TYQKGQAVIS MLEAYVGPEA WRGGVRAYIK AHAHGNTTTD DLWAEVEKAA GKPITAIAHD
FTLQPGIPLI TVEAGACAAG KTPVSLTQGE FSRDKPTKTP LAWRVPVSAQ VAGSSTVAKT
LVVGGKGSLS VDGCGPVIVN AGQAGYFRTL YAPKAFAGVS ASFAKLPAID QLGVISDAWA
LGLNGQQAVT DALDLIMATP ADADPQVWGK VAGVLTNING MYDSAPADRA AFRKLAIARL
SPAFAQVGWT AKPGEAGTIA TLRSTLITSL GALGDPAVVA EAKRRYAADK TDPAAVPGPL
RKAILATVAR NADAATWDAL HEQAKAEKTP LIRDQLYTQL ASAEDDALAA KALELALTDE
PGETLSANMI SRVSGLHADM AFDFAVAHKD AVNSKVDAAS STKFIPGLAR GSADPAMIGK
VTAYAAANLP AGSRGEADKS VASITDRIKA RKAALPQITA WVAKKG