Gene Cpin_4009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4009 
Symbol 
ID8360182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4986865 
End bp4989096 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content49% 
IMG OID644966183 
ProductGlutamate carboxypeptidase II 
Protein accessionYP_003123672 
Protein GI256423019 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.677524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0030346 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATCTGA AGACAACTGC CAGCAGTATT GCCTTATTTT TGTTACTACA AAGCTCCTAT 
GCACAACAAA AGCTGAGTGG TTTCAGCAGT GAACATGCGC AACAGCAACA GGACCTGGAA
GCCCGTTTCG ACAAAGGACT GAGCGCTGCC GCCATCGGCA ATAATATCAG GACGCTGTCT
GCCAAACAAC ATTATCTCGG TATGCCCCGG GATAAATGGG TCGCTGAGAA TATCCTGCAA
CAATTTAAAA GTTATGGCTG GGATGCCCAC CTGGAAACTT ACCAGGTGCT GTTTCCTACA
CCGAAGACCA GGGTACTGGA AGCCAGCTAT CCTGCAGGCT ATAAAGCGGT ATTGAAGGAA
CCGGCATTGA AGGAAGACGC GAGTACCGGT CAGCCGGATG AACTGCCTAC CTACAATGCC
TGGAGTGCAG ACGGTGATGT AACAGGAGAA CTGGTTTTTG TGAATTATGG TCTGCCGGAA
GACTATGAAT ACCTGGAAAG ACTGGGCATC GATGTAAAAG GGAAAATTGT CATTGCGAAA
TACGGCCGTT CCTGGAGAGG TATAAAACCG AAGGTTGCCC AGGAACATGG CGCAATCGGC
ACCCTGATCT ATTCTGATCC GAAAGATGAC GGGTATTACC AGGGGGATGT ATACCCTGTG
GGTCCTTATA AAAGCGAATA CGGGGTCCAA AGAGGCTCTA TTATGGATAT GGTGATATAT
CCGGGTGATC CTTTGACGCC TGGCGTAGGA GCGACGGAAA ATGCCCAGAG ACTGGAAAGG
TCTGCCGCTA CAAATCTGTT GAAGATCCCG GTATTGCCGA TCAGCTATCA TGACGCTGCT
CCTTTGCTGG CTGCTCTGGA AGGACCAGTC GCTCCTGATG CCTGGAGAGG CGCATTGCCC
TTTACTTACC ATATCGGTCC CGGAAAGGCA AAAGTGCACC TGAAGCTGGA ATTTGACTGG
AAAATGGTGC CTGCGTATAA TGTGATTGCT ACGATGAAAG GAAGTCAGTT TCCGGACCAG
TGGGTGATCA GGGGTAATCA CCATGATGCC TGGGTATATG GCGCAGCCGA TCCGATTAGC
GGACTGTCTT CCCTGCTGGA AGAAGCAAAG GCGATCGGTG AACTGGCAAA GAACGGGTAT
AAGCCGAAGA GAACGCTTGT GTATGCTGCC TGGGATGGCG AAGAGCCGGG TTTGCTGGGT
TCGACTGAAT GGGTGGAAGC ACATGCCGCT GAATTACAAC AGAAAGCAGT CGCTTATATC
AATTCTGATG GTAACAGCCG CGGATTCCTG GGTGTAGGTG GTTCACATGC ACTGGAACCT
TTTATGGGTG AGATTGCTAA AAGTATTACC GATCCGCAGA CGAAAGTCAG CATTTTCGAA
AGAAAACAGG CTTCCGACCT TGTATCTGCT GCTTCAACAA AAGCAAAGAA AGATATCCTG
GCTAAGAAAG ATATGACGAT CAGCGCATTG GGATCAGGGT CTGATTATTC TTCTTTCCTT
CAACACCTGG GTATTCCATC CCTGAATGTG GGTTTTGGCG GAGAAGGCGC CGGTGGTGAA
TACCATTCTA TTTATGACAC TTATGAGAAT TACTCCCGCT TTAAAGATCC GGGATTTGAA
TATGGTGTAG CCTTGTCCCG TCTGGCCGGA CATGCCGCCC TGAGACTGGC TGATGCAGAT
GTACTGCCAT TTGATTTCCG GAGCCTTTCT AAAACGATCA ACGGTTATAC CACTGACCTG
CTCTCCCTGG CAGAACAGAT GCGGGAAAAT ACAGCGGTAG AAAATCAAAT CATCAGTAAT
AATGCCTATC AACTGGCAGG AGATGTAACA AAGCCGTTAA AGGCGCCTGT CGCCAAACCG
GAAGTGCCAT ATATAGATTT TTCGAAGTTG CAGAATGCCC TGGTAGCCCT TGATAAAACG
GCACAGCATT TACAGGATGC CAGGAAACCA CAGTTACCGG CAGCGCAACT GGAGGTACTG
AATAAAGCCC TTTATCAGGC AGAGCAGCAG TTATTGCATG AACAAGGTTT GCCTAACAGG
GCATGGTATA AACACGTGAT TTATGCGCCG GGATTTTATA CGGGGTATGG TGTGAAGACA
ATGCCGGGTA TCCGCGAAGC GATTGAACAG CGTAGATGGA AAGAAGCGGA GGAACAGATT
GGTATTGCAG CGACAGCGAT CAACCGGCTG ACAGACTACC TGGAAAAGAC ATTTAATTCA
ATTAGGAATT AG
 
Protein sequence
MNLKTTASSI ALFLLLQSSY AQQKLSGFSS EHAQQQQDLE ARFDKGLSAA AIGNNIRTLS 
AKQHYLGMPR DKWVAENILQ QFKSYGWDAH LETYQVLFPT PKTRVLEASY PAGYKAVLKE
PALKEDASTG QPDELPTYNA WSADGDVTGE LVFVNYGLPE DYEYLERLGI DVKGKIVIAK
YGRSWRGIKP KVAQEHGAIG TLIYSDPKDD GYYQGDVYPV GPYKSEYGVQ RGSIMDMVIY
PGDPLTPGVG ATENAQRLER SAATNLLKIP VLPISYHDAA PLLAALEGPV APDAWRGALP
FTYHIGPGKA KVHLKLEFDW KMVPAYNVIA TMKGSQFPDQ WVIRGNHHDA WVYGAADPIS
GLSSLLEEAK AIGELAKNGY KPKRTLVYAA WDGEEPGLLG STEWVEAHAA ELQQKAVAYI
NSDGNSRGFL GVGGSHALEP FMGEIAKSIT DPQTKVSIFE RKQASDLVSA ASTKAKKDIL
AKKDMTISAL GSGSDYSSFL QHLGIPSLNV GFGGEGAGGE YHSIYDTYEN YSRFKDPGFE
YGVALSRLAG HAALRLADAD VLPFDFRSLS KTINGYTTDL LSLAEQMREN TAVENQIISN
NAYQLAGDVT KPLKAPVAKP EVPYIDFSKL QNALVALDKT AQHLQDARKP QLPAAQLEVL
NKALYQAEQQ LLHEQGLPNR AWYKHVIYAP GFYTGYGVKT MPGIREAIEQ RRWKEAEEQI
GIAATAINRL TDYLEKTFNS IRN