Gene Cpin_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5242 
Symbol 
ID8361419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6629912 
End bp6631702 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content49% 
IMG OID644967390 
Producthypothetical protein 
Protein accessionYP_003124874 
Protein GI256424221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000221753 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000135366 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAAC AACACGCCAT AGGAAAACTG GTGGTGGAGG TACAGTTACC TTCCAGGGAT 
GATGCTTTTT CAATACAGCA GGCACTGAGT ATGCGTTGTA ATGTGGAACT GACACCAATG
TTAAGTCAAT TACTGGATAG CTGGGCGGAT CCTGATACGT TGTTACAGAT TGACAAGTTG
GAGGTGGATC TGGGTAATTG CAATATGAAG GCTTTACAGG AAGAGTTACC TGGAATGGTG
ATCGCTTATC TGCAGAAGCA TTTTCAGGAT GCGACAGCAG TGCCATCACC GGAGTTAAAT
ATACAGCAGA TGCAGCGTCG TCCGCTGACA CAGGGATATT TTGAGAGCTG GCTGTATTTC
ATGCAACATG GCGTATTGCC ATCTACAGCT GTCAGATGGG AACAGGAGGA ATGGGAAGCG
GGTATTCTGA CGGCGCTGGC TACAGAAACG AGTGCTTTGA AACAATGCCA GGACCTGCTG
TCGGCGCATC CTTATACGGT ACGCCGCCTG GTGATGCAGT TCTCCCGGAG TTTTGTATAC
AACTGGGTGC TCGCTTTCAG TGCGGGCGCT TACAGACAAC AGCTGGTACT GATAGATGAA
TGGGACGCGT TTGTGTTTAG TCCGCGGTTT GTTACCGTGA TGAAAGCACA GATTGCGCAG
CAATCGCTTC CTTCCCTTCC ATTACCAGCG CATAGTCATT ATGCAACGCA GGTGATGGAG
TGGCTGATAG GAGATATCGT GATTGCGGGA AAAGCGATCA GTCACCCGGC ATTACTGGAG
CAATTAGTAC GATTGCTAGG TAGTGTATCA CAATTGCCGG CGCGTTTATT TACACTGGAA
AGAGTGACTG CTGCGGGTGT CAGTATGCCT GTAGCCGTAC ATGAAGCCAT AGGTGCTGTG
GCTGGCAGAT ATTCAGCGGA GATGACGGAA CTCAGAACGA CGTTCAGTAC AACGCCAGCT
ATTAAAAGTA TTGCGCCTGA TGGTCGTACA AAAGATCAAC AGGAGGCTAC AGAACGTCTT
CGGCGGGAGC AACAGCTTGC GGCTGCATTA CGCAGGAAGC AGGAGCAGGA TACGGCGGCA
AAACAATCTT CTTCATCAGG TGCGCCGTCC TCAGAGGCAG CTGATCATAA TAAGCGCAAA
GATCAAAAGG ATCAAAGCAC ATCCGGAGAA AGTCCGACGG CGGAAGAGCG CAGTATGGCT
GATAAAGGAA CTGTAAGTAA TGAACAAAAG CATTCATCTG ATACCGAAAC GGGAGAGGAG
CCTGCTCCGG CAGGTCTGCC TGAAGAAGGA ACGGTGTATT ACATCAATAA TGCCGGACTG
ATATTGCTGC ATCCCTATCT TTCTTATTGT TTCGATGCAT TGGAACTGAG AGAGGGACAG
CAGTTTAAAG ATGAAACGGC AAAACATAAA GCCGTACAGC TCATTGGCTA TATGGCCTAT
GGTGAAGAGG AGATTCCGGA GTGGGATCTG GTGTTGCCAA AGATCTTGTG TGGTATATCA
CCTACGGAAC CTGTGAGACG TTTCATGCCA TTATCCGAAG CAGAAAAAGA TGAAGCCAAT
CAGTTACTGG GCGCTGTGAT CACACACTGG AATGCTTTGG GTAATACCTC ACCTGACGGC
TTACGCGGCA ATTTCTTATT GCGGGAAGGT AAGCTGGAAT GGAAAGAAGA GGAGTGGCAG
TTGTTTGTCA CGCAGCAGGC ATATGATATG CTGTTGAACC GTTTGCCCTG GGGTTTCAGT
GTGGTAGGAT TGTCGTGGAT GCCTTGGCTG ATTAAGACCG TCTGGGTCTA G
 
Protein sequence
MEQQHAIGKL VVEVQLPSRD DAFSIQQALS MRCNVELTPM LSQLLDSWAD PDTLLQIDKL 
EVDLGNCNMK ALQEELPGMV IAYLQKHFQD ATAVPSPELN IQQMQRRPLT QGYFESWLYF
MQHGVLPSTA VRWEQEEWEA GILTALATET SALKQCQDLL SAHPYTVRRL VMQFSRSFVY
NWVLAFSAGA YRQQLVLIDE WDAFVFSPRF VTVMKAQIAQ QSLPSLPLPA HSHYATQVME
WLIGDIVIAG KAISHPALLE QLVRLLGSVS QLPARLFTLE RVTAAGVSMP VAVHEAIGAV
AGRYSAEMTE LRTTFSTTPA IKSIAPDGRT KDQQEATERL RREQQLAAAL RRKQEQDTAA
KQSSSSGAPS SEAADHNKRK DQKDQSTSGE SPTAEERSMA DKGTVSNEQK HSSDTETGEE
PAPAGLPEEG TVYYINNAGL ILLHPYLSYC FDALELREGQ QFKDETAKHK AVQLIGYMAY
GEEEIPEWDL VLPKILCGIS PTEPVRRFMP LSEAEKDEAN QLLGAVITHW NALGNTSPDG
LRGNFLLREG KLEWKEEEWQ LFVTQQAYDM LLNRLPWGFS VVGLSWMPWL IKTVWV