Gene Caul_3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3199 
Symbol 
ID5900654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3459669 
End bp3461498 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content65% 
IMG OID641563704 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001684824 
Protein GI167647161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.179758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.906786 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCGA AAATCGCGTC GAGGGGGTTC GCTTTGCCTA TTCAGTTTCA CCGGCCGTTC 
CATCGGACCG CGGCCCTCTG CGCCGTCTCA TTGCTGGCCT TGACGGCCGT CGCTCACGCC
GCCCAAGCCG AAAGTCCGGC CAAGCCGTCC CCGACCACCG CCTTGACCCT GGGCACCGGC
GGCCAGATGC CCGCCGAGGA AGCGGCGGTG ACGCTTGAGC ACGTCGACCT GAAGCTCAAG
ATCATCCCCG AGCGCAAGGC GATTGATGGC GACGCCACCC TGACCCTGGC GGCCAAGAGC
CCCCTGCCCC GCATCGTCCT GGACTTCGAC AAGAACTTCA CGGTCAGCGC CCTGATCGTC
GATGGCAAGA CCCTGCCCGC CTCGGCCTGG ACCAATCCGG AAGGCCGGCT GACCATCAAC
CTCCCAGCCC CGATCGCCGC TGGCGCCAGG ACCGTGGTGC GGATCCGCTA TGCCGGCGCG
CCGCACGAGG CCGAGAAAGC GCCGTGGGAC GGCGGCTTCG TCTGGAAGAC CACGCCGACC
GGCGAGCCCT GGATCGCCAG CGCCGTGCAG GGCGACGGCT GCGACCTGTT CTGGCCGTGC
ATCGACTTTC CGACCGGCGA GCCGCTTCTG GTCGATTTCT ACATCACCGT CCCCGCCCCG
CTGTCGGCCC CCGCCGGCGG CGTGTTGGTC GGCGTCGAGG AGAAGGACGG CTGGCGCACC
TTCCACTGGC GTCAGAAACA GCCCGACACC TACGCCATCG CGGTCAATGT CGGTCCCTAT
GACAAGCTGG AAGCTGCCTA CAAGAGCCGG TTCGGCGACA GCTTTCCGAT CGAGTACTGG
TATCTGAAGA GCGACGATCC GGCCAAGGCC AAGGCCCTGT TTGCCGAGTT TCCGACCACG
CTGGACTTCT TCGAGCAGAT GATCGGCCCC TACCCGTTCC GGTCCGAGAA GCTGGGCGTC
GTCGAGACCC CGCACCTGGG CATGGAACAC CAGACCATGA ACGCCTACGG CAACGAGTAC
CGCAAGGACG TGTTCGGCTA CGACTGGCTG TTCCAGCATG AGCTGTCGCA CGAGTGGTTC
GGCAACCAGG TGACCAATGT CGATTGGGAC GACATGTGGA TCCATGAAGG CCTGGGCAGC
TACATGCAGC CGCTGTTCTC GCAGTGGCTG CACGGCGACA TGGAGTACAT GACGCGGCTG
AACGCCCAGC GGGTCGGCAG CAAGAACCAG TTCCCGATCG TCTCCGACAA GGTGATGACC
GAGGATCAGG TCTACAAACC CGAAGGCGGC CCGGCGAACG ACATCTACGC CAAGGGCTCG
AACGTCATGC ACACCCTGCG GGCGACGATC GGCGACGAGG CGTTCTTCAA GTCGGTGCGC
ACCCTGGTCT ATGGCCGCCC GGACCCCAAG CCCGGCAATT TCGCCCCGCG CTACGCCACG
ACCAAGGACT TCATCGCGAT CGTCAACAGC GTAACCGGCA AGGACTATCA GTGGTTCTTC
GACGCCTACT TCTACCAGGC CAAGCTGCCG GAACTGCGCG AGACCCGCGA CGGCGACGAT
CTGGTGCTGA GCTGGAAGAC CCCCTCGGGC AAGGCCTTCC CGATGCCTGT CGAGGTCAAG
GTCGGCGACA AGGTCGTCAC CGCCCCGATG GCCGACGGCA CGGGCCGGAT CAAGGTCGGC
GACGCCGTGC CGGTGATCGT CGATCCCGCG TCCAAGATCC TGCGCCGCCA GCCCTATCTG
GAAGACTATC AGGCCTGGAA AAAGGCCGCG GACGAAGCCG CCAAGAAGGC CGAAGAGGCC
AAGAAGGCGG CGACCGCGAA GAAGTCGTAG
 
Protein sequence
MSPKIASRGF ALPIQFHRPF HRTAALCAVS LLALTAVAHA AQAESPAKPS PTTALTLGTG 
GQMPAEEAAV TLEHVDLKLK IIPERKAIDG DATLTLAAKS PLPRIVLDFD KNFTVSALIV
DGKTLPASAW TNPEGRLTIN LPAPIAAGAR TVVRIRYAGA PHEAEKAPWD GGFVWKTTPT
GEPWIASAVQ GDGCDLFWPC IDFPTGEPLL VDFYITVPAP LSAPAGGVLV GVEEKDGWRT
FHWRQKQPDT YAIAVNVGPY DKLEAAYKSR FGDSFPIEYW YLKSDDPAKA KALFAEFPTT
LDFFEQMIGP YPFRSEKLGV VETPHLGMEH QTMNAYGNEY RKDVFGYDWL FQHELSHEWF
GNQVTNVDWD DMWIHEGLGS YMQPLFSQWL HGDMEYMTRL NAQRVGSKNQ FPIVSDKVMT
EDQVYKPEGG PANDIYAKGS NVMHTLRATI GDEAFFKSVR TLVYGRPDPK PGNFAPRYAT
TKDFIAIVNS VTGKDYQWFF DAYFYQAKLP ELRETRDGDD LVLSWKTPSG KAFPMPVEVK
VGDKVVTAPM ADGTGRIKVG DAVPVIVDPA SKILRRQPYL EDYQAWKKAA DEAAKKAEEA
KKAATAKKS