Gene Cpin_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4197 
Symbol 
ID8360370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5246614 
End bp5249805 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content48% 
IMG OID644966366 
Productpeptidase domain protein 
Protein accessionYP_003123855 
Protein GI256423202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000751128 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.547659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAAAC TTCTACTCCT CGCAACTGCA CTACTGTGCA GTACACTGTC GTTTGCACAG 
GACTACTGGA GGCCACACAC CGACGGCGCC AGAATTAGTA CCGATAAAGC TGTAGCCCGT
CTTGCTTTCC CTACTGAGTT TAAACTGTTT GATCTCAATT TCACCCCTTT CAGAGATCAG
GCTTTCAGAT CGGTCGGTAA TAAGTCCGCA CATGCTACCA TTATTTCTTT ACCCAATGCC
GACGGACAGA TTGAACAATT CGAAATTACA GAAGCATCCA ATTTCGAACC TGCTTTACAA
GCCCGGTTCC CGGATATCAG AGCCTTTTCG GGAAAGGGTA TTACCGACAA ATATGCCACG
TTAAAACTGA GCATATCTCC CGAAGGTATA CAGACGACTG TATTCCGTAC CGGGAAAGAA
AATGAATTTA TTGAGCCGTA TTCTGCTGAC CATACAGTAT ATACTGTATT CAAGAAAAGA
GCGAATACCT TACCTTGGAA ATGTTCCACT CCTGAGCAAC AACTGGCGAC TGACATAGGC
AGCCGCATAC CTGATGTTGC AGGCCGGTCC ACCGGTGATG CCAAGACATT ACGGCTGGCA
CAATCGGTAA CCGCTGAGTA CTCCAATTAC TTTGGCGCTA CCAGCGCTTC ACAGGTAGCC
TTAGTACTGG CCGCTGTAAA CGCTACGCTG ACCCGTTGTA ACGGGGTGTA TGAGAAGGAT
CTTGCGATCC ATCTTAACCT GGTGGCTTCC ACTACCAGTG TATTCTATTA TAATGCTTCC
ACAGATCCGT ATTCTTCTGC CAGTTCAGGA GCCGGTGGCG CCTGGAACGG TGAATTACAG
AGTACCCTGA ACTCCGTAAT CGGTGCTGCC AACTACGATA TCGGTCACCT GTTTGGTGCA
TCCGGTGGTG GCGGTAACGC TGGTTGTATC GGTTGTATCT GTGTTGATAA TTCAAAAGGA
AGTGGATTCA CCTCTCCTGC TGATGCGATT CCACAGGGCG ATAATTTCGA TATCGACTAT
GTGGTACACG AAGTAGGTCA CCAGTTAGGC GCTAACCACA CTTTTTCTAT GAGCAATGAA
GGTACAGGTG TGAACGTTGA ACCTGGTTCA GGTATCACGA TCATGGGGTA TGCCGGTATC
ACCAGTCAGG ATCTGGCGCC GCATTCTATT GATATTTTCC ATGCGGCTTC CATTGCACAG
ATCCAGGCAA ACCTGGCAAC TAAGAGTTGT CCTGTAACAA CCGTTATATC TGGCAATAAC
GCTACGCCAG TAGTAAGTGC CGGTGGTAAT TTTACCATTC CTATCAGCAC GCCGTTTGTC
CTGACTGGTT CCGCTACTGA CGCTAATCCA TCTGATGTAC TGACATATGC CTGGGAACAA
TTTGACAATG CCTCTTCTTC ACAAACAGGC TCCAGCAGTG TGGCCAGTCC GACCAAAGCT
ACTGGTCCGA ACTGGATCTC ATTGCCGCCA GTTACCTCAC CTACCAGGTA CTTCCCTAAA
CTGGCGACCG TCCTCGCCGG TAACTTAGTC TCTGGTCCTT TAACCGGTGG TGATGCAGGC
GCGAATACAG AGGCATTAAG TGCTGTGTCC AGGACCCTGC GTTTCCGTTT AACAGTAAGA
GATAATGCCC CTTATAGTTC TACCGCGCCG GTGACTATCG GACAGACCAA CTTCTCGGAT
ATGACGGTTA CTGTCAGCAA TACCTCCGGA CCATTCTCCG TAACGGCTCC TAATACGGCT
GTTTCCTGGG CGGGTAATTC TTCCCAGAAT ATCACCTGGA ATGTGGCCAA TACAACCGCT
TCTCCCGTAA GTTGTGCGAA TGTTAAGATC TCCATTTCTA CTGACGGAGG TAATACATTC
AGCACCTTAG TTAGCAGTAC ACCGAATGAT GGTAGTCAGT CGGTTATCAT TCCTAACACA
CCAACAACGA CAGCCAGAAT AAAAGTCGAA TCTGTAGGAA ATATCTTCTT TGACATTTCC
AATACCAACT TCACGATTGT ATCCGGATCA TCCTGTACTT CTCCGACAGG ACTGAGCGAT
TCAGCGATTA CCGCTACTTC TGCTACGATC ACCTGGGCAG ATGTAAGTGG AGCAGTATCG
TATGACGTTG ATTACAAAGC AGTCGCAGAT ACTACCTGGA TCAGTGCTGC CGCAGGCATT
ACTGTTACAG GCGTTAACCT GACAGGTCTG CTGGCAGGAC ATACATATGA TTACAGAGTA
AGGTCACATT GTGCGAGCGA TAGTAGTTCT TATGCTGTCG GACAGCTCAC AACTACGCCA
AGTGCATCCT GTAGCGCACC TGGTGGTTTA AGCAGTACAT CGGTAACTAC GACTGCTGCT
GTTATCAGCT GGACGGCTGT AAGCGGTGCA TTGAGTTATG ATGTTGATTA TAAACCAGCT
GCAAGTGCTA CCTGGATCAA TGTCCAGGCT GGAGCGACAG CAACATTTGC GAACCTGACT
GGATTAACCG CTACCACTAC ATACGACTGG AGAGTGAGGA CAAACTGCTC CGATGGTAAT
GGCGCTTATG CGAGTGCACA GTTCACTACC CAAACACCGC CAACCTGTGC TAATAACAAG
GATACTGCTA CCAATGGTAA TACTGCCGGT GCAGCTACTA TTCCATTCAA TACGGATATT
ACGGGTCTTA TCAGTCCGTC TGGTGACATC GATCATTATA AGTTTGTGAT CACCACTGCA
GGTACGATCA CTATCACCCT GGGTACCTTA CCAGGCGACT ATGACCTGAA ACTGCTGAAC
AGTGCCGGTA CACAGCTGGC CATTTCACAG GCGGGTGGTA CGAGCAGTGA AACGATCAGC
AGAAATATGA CGCCAGGTAC TTATTATGCA CAGGTATATG GTTACAATGG CGCTAACAGT
ACTACATCTT GTTATACATT GCGCGTACAG TTAGGTACTG CCAGCCGTAG CACGGATGTA
AGCAGCGGAG ATCTCCCTAA AGTAGCGGTA TTCCCGAACC CTGCTAATAA CGTAGTGAAC
GTTAATCTGA CAGGCTTCAA AGGAAAATCT GCCGTAACGA TGTATGATGT CAATGGTCGT
GTGGTATTAC GCCGTGAAAT GAATCCGGTG AATACGCAGC TGGACATTTC CACATTACCG
ACAGGTGTTT ATATTCTGAA GATCAGGAAC GGTGCGAAAG AGGTGAATAT GACGAAGATC
ATCAAGCAAT AA
 
Protein sequence
MRKLLLLATA LLCSTLSFAQ DYWRPHTDGA RISTDKAVAR LAFPTEFKLF DLNFTPFRDQ 
AFRSVGNKSA HATIISLPNA DGQIEQFEIT EASNFEPALQ ARFPDIRAFS GKGITDKYAT
LKLSISPEGI QTTVFRTGKE NEFIEPYSAD HTVYTVFKKR ANTLPWKCST PEQQLATDIG
SRIPDVAGRS TGDAKTLRLA QSVTAEYSNY FGATSASQVA LVLAAVNATL TRCNGVYEKD
LAIHLNLVAS TTSVFYYNAS TDPYSSASSG AGGAWNGELQ STLNSVIGAA NYDIGHLFGA
SGGGGNAGCI GCICVDNSKG SGFTSPADAI PQGDNFDIDY VVHEVGHQLG ANHTFSMSNE
GTGVNVEPGS GITIMGYAGI TSQDLAPHSI DIFHAASIAQ IQANLATKSC PVTTVISGNN
ATPVVSAGGN FTIPISTPFV LTGSATDANP SDVLTYAWEQ FDNASSSQTG SSSVASPTKA
TGPNWISLPP VTSPTRYFPK LATVLAGNLV SGPLTGGDAG ANTEALSAVS RTLRFRLTVR
DNAPYSSTAP VTIGQTNFSD MTVTVSNTSG PFSVTAPNTA VSWAGNSSQN ITWNVANTTA
SPVSCANVKI SISTDGGNTF STLVSSTPND GSQSVIIPNT PTTTARIKVE SVGNIFFDIS
NTNFTIVSGS SCTSPTGLSD SAITATSATI TWADVSGAVS YDVDYKAVAD TTWISAAAGI
TVTGVNLTGL LAGHTYDYRV RSHCASDSSS YAVGQLTTTP SASCSAPGGL SSTSVTTTAA
VISWTAVSGA LSYDVDYKPA ASATWINVQA GATATFANLT GLTATTTYDW RVRTNCSDGN
GAYASAQFTT QTPPTCANNK DTATNGNTAG AATIPFNTDI TGLISPSGDI DHYKFVITTA
GTITITLGTL PGDYDLKLLN SAGTQLAISQ AGGTSSETIS RNMTPGTYYA QVYGYNGANS
TTSCYTLRVQ LGTASRSTDV SSGDLPKVAV FPNPANNVVN VNLTGFKGKS AVTMYDVNGR
VVLRREMNPV NTQLDISTLP TGVYILKIRN GAKEVNMTKI IKQ