Gene Cpin_4000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4000 
Symbol 
ID8360173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4976737 
End bp4978665 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content47% 
IMG OID644966174 
Producttransglutaminase domain protein 
Protein accessionYP_003123663 
Protein GI256423010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000741085 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0455178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCGT TTTTAACCAA AAAATACGCA TCCGCAGTTT GTTACTGTGT ATGCGTATTA 
TTGTCCATAG GCCCGGCTTT TGCCGGAGAC CCGATTTATC CGGCCATGCT GATTCCAGAT
TCTCTGAAGA AGAATGCACA CGCGGTAACG CGTCTGGAAG AGGTGACCGT AAAGGTCAAT
GATCCCCGGG ATGTGCGTAT GACCATGCAC TATATTGTTA CGGTGCTGGA TGCGGAAGGG
GAGAAGTTTG CCTATTTCGC CGGTGGCTAT GATAAATTGA CGGAAATCCG TTCTATTAAA
GGCACCTTAT ACGACGGACT GGGATTACCC ATCAAAAAGC TTAAACAGAG CGATATACAG
GATCTGAGCG GCACTGGCGG TGATCTGATG ACGGATGACC GTATTAAACG CCACGTCTTT
TATCACAACC TGTATCCGCA CACCGTGGAG TATGAGGTGG AAATCAGGTA TAATCATAGC
TATTACCTTC CAAAATGGCG TCCGCAGGAC GACGAATCCA TTGCGGTAGA GCAGAGTAAA
CTGACCGTGA TTACGCCAAA GGACTATTTA CTCCGCTATA AGGCGCTCAA TTATAAAGGC
GAACCCTTAT TGGGAAATGA CGGATCCGAC CGTACTTATA CCTGGGAGGC GAAGAACCTT
TGTGCTGTTC CGGAAGAACC TTATGCGCCA CATTGGAGCA CCCGTTCTAT ATCGGTGCTG
CTGGCTCCGG CGTCATTTGA AATGGCACAG TATAAGGGGA CGATGAATAC CTGGGAAGAA
TTCGGGAAAT TCTCCTATAT ACTGAATCAG GGAAGAGACG TACTGCCGGA TAATATAAAG
CAGACCGTAC ATCAGCTGAC GGATGGTCTA GCCCGCGAGC AAAAGATCTC GAAGCTCTAT
GAATATCTGC AACAGCATAC CCGTTATATC AGTGTACAAT TGGGTATAGG CGGCTGGCAG
ACTTTCGATG CTGCTTATGT GGCCTCCAAA GGGTATGGCG ACTGTAAGGC ACTTTCCAAT
TATATGTGTG CAATGCTGAA GGAAGCGGGT GTTAAAGCCT CCTGTGTGCT GGTATATGCC
GGGGAAGACA GGAATGATGT AACACTGGCA GATTTTCCTT CGCCCAGTTT CAATCACGTA
ATCGTATGCG TACCCGATAC AAAAGATACG ACATGGCTGG AATGCACCAG CAGTACAGTG
CCTCTGGGCT ATATGGGAGA ATTTACCGGT AACAGGTCTG TGCTGATCGT AGACGAAAAC
GGGGGGAAAC TGGTACGTAC ACCTGTCTAT TCTATGGAGC AGAATGTACA GACACGTAAT
ATCGTCGCTA AAGTGGAGGA ATCCGGCGAA ATGAGTGTCA GGGCTAATAG CCGTTATAGC
GCATTACAGA CGGATGACCT GCATTCAGCA CTGAACAGCC TTACCAAAGA GAAACTGATG
GAGGCGCTGA AACAAGTGGG CTTTTTCCCC AGTTACGAGG TGAAAAGTTA TGACTGGAAG
GAAACTAAGT CTGTGTTGCC ATATATTGAC GAGCGGATCG AAATTACTGC CCGTAACTAT
GCTACGATCA CGGGTAAACG CATGTTCATC GAGCCTAACC TGATGAACAA GACTTCAAAG
CGATTATCTG TTGATTCCGT ACGCAGGGCA GACATCTACC TGAGTCATTC CTATCGTGAT
ATCGATACCG TAAAGATCAC TATTCCGGAG GGTTATACAC CTGAAGCGAT GCCTCAGCCA
ATGACCTTAG AGAGTCCTTT CGGGGTTTAT TCTTCAAAAG TGAGTATCGA AGGGAATGTG
ATCACTTATA TCCGTTCTAT TGATCATAAA GGAGGTACTT ACCCTGCCAG TTCCTATGGG
GAGCTGGCAA AGTTCTATAA TAGTATGTAT AAAGCAGACA GGAGCAGGAT CGTTCTCGTC
AAAAAGTGA
 
Protein sequence
MFSFLTKKYA SAVCYCVCVL LSIGPAFAGD PIYPAMLIPD SLKKNAHAVT RLEEVTVKVN 
DPRDVRMTMH YIVTVLDAEG EKFAYFAGGY DKLTEIRSIK GTLYDGLGLP IKKLKQSDIQ
DLSGTGGDLM TDDRIKRHVF YHNLYPHTVE YEVEIRYNHS YYLPKWRPQD DESIAVEQSK
LTVITPKDYL LRYKALNYKG EPLLGNDGSD RTYTWEAKNL CAVPEEPYAP HWSTRSISVL
LAPASFEMAQ YKGTMNTWEE FGKFSYILNQ GRDVLPDNIK QTVHQLTDGL AREQKISKLY
EYLQQHTRYI SVQLGIGGWQ TFDAAYVASK GYGDCKALSN YMCAMLKEAG VKASCVLVYA
GEDRNDVTLA DFPSPSFNHV IVCVPDTKDT TWLECTSSTV PLGYMGEFTG NRSVLIVDEN
GGKLVRTPVY SMEQNVQTRN IVAKVEESGE MSVRANSRYS ALQTDDLHSA LNSLTKEKLM
EALKQVGFFP SYEVKSYDWK ETKSVLPYID ERIEITARNY ATITGKRMFI EPNLMNKTSK
RLSVDSVRRA DIYLSHSYRD IDTVKITIPE GYTPEAMPQP MTLESPFGVY SSKVSIEGNV
ITYIRSIDHK GGTYPASSYG ELAKFYNSMY KADRSRIVLV KK