Gene Cpin_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3594 
Symbol 
ID8359761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4480934 
End bp4484236 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content48% 
IMG OID644965764 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003123258 
Protein GI256422605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.29124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000227347 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAGTGA CAGTCCGGAT TGAAAAAGTA TCCATTCCTA CCTACAAGAC AGGCACACCG 
GAAAAGCATC CTATGTTCCT GGAGAAAAGA GTTTACCAGG GCAGCAGTGG CGTTGTGTAC
CCACATCCCG TTATTGAGCA GATTGCAGAT GAGAAAAGTT CGCAGGAATA CAATGCGGTA
TTCCTCGAAA ATGAATTCCT GGAGATCATG ATCCTCCCCG AACTGGGCGG ACGTATCCAA
AGGGCATATG ATAAAGTCAG ACAGCGTGAT TTTGTGTATT ATAACCAGGT GATTAAGCCG
GCCCTGGTAG GACTGGCGGG TCCCTGGATC TCTGGTGGTA TTGAATTCAA CTGGCCACAG
CACCATCGTC CGAGCACCTT CCAGGCAGTA GATTATGCCG TAGAAGAAAA TAATGATGGC
AGCAAAACCG TGTGGATGAA TGAAGTGGAA GTGATGTTCC GGACTAAAGG AATGGCAGGT
TTTACCCTGT ATCCTGACAA GGCATATCTT GAGATAAAAG GCCGTTTATT CAACCGGACG
ATGTTCCCCC AAACCTTCCT CTGGTGGGCT AATCCGGCAG TAAAAGTGAA TGACCACTAT
CAGTCCGTAT TTCCACCAGA TGTATACGCC GTATTTGATC ACGGAAAAAG AGATGTATCT
GATTTCCCGA TCGCAACCGG TACCTATTAC AAAGTAGACT ATGCACCGGG TACAGATATC
TCCCGCTACC ATACCATTCC TGTACCGACC TCCTATATGG CCATCCGCTC GGAATACGAT
TTCATGGGTT GTTATGAGCA CGATACACAG GCGGGAATGC TCCATGTAGC AGACCATCAT
GTATCACCGG GCAAAAAACA ATGGACATGG GGGAATGGTG ACTTCGGCTA TGCATGGGAC
AGAAACCTGA CTGACGAAGA CGGTCCGTAT ATCGAACTCA TGACCGGTAT GTTTACCGAC
AATCAGCCCG ACTTCTCCTG GTTACAGCCC AATGAAGGCA AGACCTTTGA GCAGTATTTT
ATGCCTTATG CGCAGACGGG CGTCGTGAAG AATGCAACCA AAGAAGCGAT GCTCAATATG
GAGTGGGAAG GACAACAACT GCACATCAAA ATATATGCAA CTGCCCTCTA TAAAGACGCC
AGCGTACGCG TATTACATGA TGGTAAGGAA GTAAAAAACT TCCCGGCCGC CCTCTCTCCA
TACGCACCGT TTTCAACTAC CTATGATTGC GGCGCTGCTG TAGTACCCGA ACAATGGAAG
GTGATCGTTG CAGATCAAAA CGGCCGTGAA CTTGTCTCCT GGCAACCGGC CGCACCTGTG
ACACACGAAA TACCACCACC TGCATCCGCT GCAAAATTGC CTAAAGACAT TGAGCAGGTA
GAAGACCTTT ATCTGAATGG TCATCACCTG GAACAATACC GTCATGCAAC ATTCAATCCG
GTTGATTACT ACGAAGAAGG ATTGAGAAGA AGTCCGGGAG ACATCCGCTG TAACAATGCC
ATGGGCTTAT TACTGCTGCG TCGCGGTCAA TTTGCCAAAG CGGAACCTTA TTTCAGAACG
GCGATCAAAA CCATGACCAG CCGTAATCCG AATCCTTATG ACGGCGAAGT ATATTATAAC
CTTGGTTGTT CGCTGCTCTT ACAGGACAAA TCAGCCGAAG CATATGATAT ATTCTTCAAA
GCCACCTGGA ATGATGCATG GCAACACAGC GGTTTCCTCA TGCTGGCACG TATCGCAACT
TCCAATGGTA AATGGGATGA TGCTTTGTCA CTGGTAAAAA AATCACTTGT GCGCAATTAT
CACAACCATA CTGCCCGTCA CCTGCAAACG ATCATCCTGC GTGAGAAAGG ATTACAGGAA
GATGCGATTG CCTTCGCGAA GGAATCCATA CAGATCGATC CGTTTAACTA CGGCTGTCTG
TTTGAATGGT ACCTGATCAG CAATGACAGC AGTGTATTAC AACGCATGCG TTCACTGATG
CACGGTTCCA TCAACAACTA CCTGGAACTG TCACTGGACT ATGCTTATGC AGGCGCTTAC
GATACCGCCA TCACCTTGTT ACAATCTTAT ATCGATACTT ACGACCGTGC TTCCCCGCTC
CTGTACTACT ATATGGGATG GTTTGCCAGC CGTAGTCATT TACCAGAGCA GGCACTGAGC
TTCTATCGCA AAGCCGCCAC GCAATCCCCC GATTACTGCT TCCCTAACAA AACAGAAGAA
GTACTGATCC TTCAGGATGC ATTGTTGCTC AACCCTACAG ACGCCAAAGC CGCCTATTAC
CTGGGTAACC TCTGGTATGA CAAACGTCAG TATACAGCAG CGATCGCCTG TTGGGATCAA
TCCGCCACAC TGGATGATAC CTATGCCACC GTGCACAGGA ACCTTTCACT GGCTTACTAC
AACAAATTGC ATCAGAAAGC GGCTGCTGTC AATGCCATGG AAAAAGCCTT TACACTGGCG
CCGGATGATG CCCGTATACT CATGGAACTG GACCAGCTGT ATAAGATCAC CGGCCGGTCT
TCACAGGAAA GATTATCACT GCTGGAAAAC AATATGAAGC TGGTAGCAAT GAGAGACGAT
CTGTACCTGG AAAGAATCAC GCTTTACAAT AACTTAGGAG AGTTTGAGCA GGCCCGCAGA
CTCCTGTCAC AACGTAAATT CCATCCATGG GAAGGTGGTG AAGGAAAGGT AGTAGGACAA
TATATCCTTT GTCATACTGC GCTCGCAAGA AAGGCGATTG ATCAGGGACA ATATGAAAAC
GCTCTCTCTC TCCTGCAGAC CCTGGAAAGC TATCCGGAGA ACCTTGGTGA AGGCAAACTC
TATGGTGCAC AGGAAAATGA CCTTCATTAT CTGAGGGGCT GTGCTTATGC AGGTCTGGGT
CAACAGGAAG CAGCAAACGA ACAATTCCTG GCAGCTACTG TTGGTATCAG CGAACCTGTG
CAGGCGATCT ATTACAATGA TCCGCAGCCA GACAAAATCG TTTATCAGGC ACTGGCCTGG
CAGCAACTGG GACAGCCGGA AAAAGCAGCC CGGATCTTTG ACCGCTTCAT ACAATTTGGC
AAGGCACATC TGCATGATGA AATCCGCATA GATTATTTCG CCGTTTCACT ACCGGATATG
CTGGTATTTG ATATCGACCT TAACCAGCGT AACCGTATTC ACTGTCTTTA CCTGATGGGG
TTAGGCTACC TGGGACTACA ACAGGAACAA CAGGGTCAGC AACATCTCGA CCAGGTATTG
ATGCTGGATG TAAATCATCA GGGCGCTACT TTTAATCCAT TTACCAGGAC TATTCTATGT
TAA
 
Protein sequence
MEVTVRIEKV SIPTYKTGTP EKHPMFLEKR VYQGSSGVVY PHPVIEQIAD EKSSQEYNAV 
FLENEFLEIM ILPELGGRIQ RAYDKVRQRD FVYYNQVIKP ALVGLAGPWI SGGIEFNWPQ
HHRPSTFQAV DYAVEENNDG SKTVWMNEVE VMFRTKGMAG FTLYPDKAYL EIKGRLFNRT
MFPQTFLWWA NPAVKVNDHY QSVFPPDVYA VFDHGKRDVS DFPIATGTYY KVDYAPGTDI
SRYHTIPVPT SYMAIRSEYD FMGCYEHDTQ AGMLHVADHH VSPGKKQWTW GNGDFGYAWD
RNLTDEDGPY IELMTGMFTD NQPDFSWLQP NEGKTFEQYF MPYAQTGVVK NATKEAMLNM
EWEGQQLHIK IYATALYKDA SVRVLHDGKE VKNFPAALSP YAPFSTTYDC GAAVVPEQWK
VIVADQNGRE LVSWQPAAPV THEIPPPASA AKLPKDIEQV EDLYLNGHHL EQYRHATFNP
VDYYEEGLRR SPGDIRCNNA MGLLLLRRGQ FAKAEPYFRT AIKTMTSRNP NPYDGEVYYN
LGCSLLLQDK SAEAYDIFFK ATWNDAWQHS GFLMLARIAT SNGKWDDALS LVKKSLVRNY
HNHTARHLQT IILREKGLQE DAIAFAKESI QIDPFNYGCL FEWYLISNDS SVLQRMRSLM
HGSINNYLEL SLDYAYAGAY DTAITLLQSY IDTYDRASPL LYYYMGWFAS RSHLPEQALS
FYRKAATQSP DYCFPNKTEE VLILQDALLL NPTDAKAAYY LGNLWYDKRQ YTAAIACWDQ
SATLDDTYAT VHRNLSLAYY NKLHQKAAAV NAMEKAFTLA PDDARILMEL DQLYKITGRS
SQERLSLLEN NMKLVAMRDD LYLERITLYN NLGEFEQARR LLSQRKFHPW EGGEGKVVGQ
YILCHTALAR KAIDQGQYEN ALSLLQTLES YPENLGEGKL YGAQENDLHY LRGCAYAGLG
QQEAANEQFL AATVGISEPV QAIYYNDPQP DKIVYQALAW QQLGQPEKAA RIFDRFIQFG
KAHLHDEIRI DYFAVSLPDM LVFDIDLNQR NRIHCLYLMG LGYLGLQQEQ QGQQHLDQVL
MLDVNHQGAT FNPFTRTILC