Gene Cpin_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3040 
Symbol 
ID8359205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp3751708 
End bp3753972 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content45% 
IMG OID644965219 
ProductABC transporter related 
Protein accessionYP_003122715 
Protein GI256422062 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGA ATAACCGTAT GATAAAGGTA TCAGGAGCCC GCCAGAATAA CCTCAAAAAT 
ATCTCACTGG AGATCCCTAA GGGCCGGATT GTTGTTTTTA CCGGGGTATC AGGATCTGGA
AAATCATCCC TGGTTTTCGA AACGATCGGC GCAGAGGCAC AGCGACAGGT AAATGAAACC
CAGCATAGCT TCACCAGGAA CCGGTTGAGC CATCTCGGTA TTCCGGATGT CGATAAGATT
GAACACCTGA ATGTTCCTGT TATCATCAGT CAGAAACGGA TAGGAGGTAA TGCCAGGTCT
ACAGTCGGGA CCGCAACTGA CATCTACGCC TCTTTACGGC TTTTGTTTTC TAGGATGGGA
AACCCTTTTG TAGGTTATTC AAGTATCTTT TCTTTTAACC ACCCCCAGGG TATGTGTCCC
ACCTGCGAAG GACTGGGATT TATAAGTACG ATTAACACAG ATGCACTGCT TGACAAGGAT
AAGTCACTGA ATGAAGGCCC CATATTGTTC CCTACCTTCC AGCCTGGAGG CTATAGATGG
ATCCGTTATG CCCATTCAGG TTATTTTGAC CGGGATAAAA AACTAAGAGA CTATAGTAAG
AAAGAACTGG AATTACTACT CTATGCGGAG GAACATAAGC CACCGCATCC GGATAAGTCT
TGGGGCAAAA CAATGCAGTA TATCGGCTTA CTGCCAAGGA TTAAAAACGA CTTTCTTAAA
AAGGAATCTA AAGAACATCA TCTGCGGAAA AAGGAACTGC AGAAAATTAT AAACACGGTA
AGCTGTCCTG ATTGTAAGGG CAAAAGGCTT AACAAAACGG TACTTTCCTG TAAAATCAAG
GGAATGGATA TTGCAGATTG TTCTGCAATG CCTGTTGGCG AACTACTCGC ATTTATTCAT
ACGCTCTCTT CGAAGTCCTT TAGCGCAATA CGGTACGAGT TGCAGAAAAA GCTGGAGAAT
GTAGTCACCA TAGGGTTGCA GTATCTTACA CTGGACAGAA CCACTGATAC GCTTTCAGGC
GGTGAGTCAC AACGAATCAA GATGGTCAGA CATCTGGGAA ATAGTATCGA GGATCTGCTG
TACATTTTCG ACGAACCAAG TATTGGTCTA CATGCAAAGG ATCTGGATAA CATCTCAAAG
ATCATCCGGA AAATAAAGGA AAAAGGAAAC TCGGTACTGA TTGTAGAGCA TGACCCGGAT
ATCATCAAGA TCGCGGATCA CGTAATAGAT GTAGGTCCGC TGTCAGGTGT CCATGGGGGA
GAAATTATTT ATGAAGGATC GTTTAAAGGA TTGCTCGCTT CAAAGGGTAA GACCGGGAAG
TATTTTTCAC AGAAACGTAC ATACAGAGAC AACCCACGTA CGGCTCAGAC ATCCATCAGG
ATAGAGAATG CCAGCCTGTT CAATCTTAAG AATATTACAG TGGATATTCC CAGGAACGTA
TTAACGGTCG TTACCGGCGT CGCCGGTTCT GGCAAGAGTA CACTGATAGG GAAGGTGTTG
CCGCTACAAT TGCCGGAGGT GAAAATAATC AACCAGTCTC CGATTACAGG CAGTCAAAGA
AGTACATTAC TGACCTATCT GGACCTGTCT GACAAACTAA GAAGATTGTT TGCTTCAGCA
AACAAGGTCA GTGACCGGCT TTTCAGTACC AACAGTCTCG GCGCATGTCC TGATTGTAAA
GGCTTGGGGG TTGAAAAAAT AGACCTTGCC TTTATGGATG ATATTGAACA ACCATGCGAC
GTCTGCCATG GAACCGGATT TGATCCTAAA GTCTTAAAGT ATAAATTCCT GCATAAGAAT
ATAGCAGAGG TGCTGAATCT GACAGTAGAG GAAGCGATGA CCTTTTTTGA GCAATATGAT
TTTGTCACAC ATTTTGAACT GCTGATTGCA CTCGGTCTCG GGTACCTGAA ACTAGGACAA
CGCCTCAGCA CATTTTCAGG TGGTGAAAGG CAACGTTTAA AGTTAAGCGC CGAGTTGGCG
GAAGCCAATA AAATACTCGT ACTTGACGAA CCAAGTACCG GACTTCATCC GGCAGACACC
GGGCAACTAC TGCAAGTCCT GGACAGGCTG GTAGAGCAGG GCAATACCGT GATTGTCATT
GAGCATAACC TGGATGTCAT CGCACAGGCA GATTGGATCC TGGATATTGG CCCCGGAGCC
GGTAAATATG GCGGTGAACT GGTCTTCCAG GGAAATGTCA GGCAGCTGCT GGAAAATAAG
GTATCTGAGA CGGCGAAGTA TTTAAAACAA CACTTAATTG CTTAG
 
Protein sequence
MKSNNRMIKV SGARQNNLKN ISLEIPKGRI VVFTGVSGSG KSSLVFETIG AEAQRQVNET 
QHSFTRNRLS HLGIPDVDKI EHLNVPVIIS QKRIGGNARS TVGTATDIYA SLRLLFSRMG
NPFVGYSSIF SFNHPQGMCP TCEGLGFIST INTDALLDKD KSLNEGPILF PTFQPGGYRW
IRYAHSGYFD RDKKLRDYSK KELELLLYAE EHKPPHPDKS WGKTMQYIGL LPRIKNDFLK
KESKEHHLRK KELQKIINTV SCPDCKGKRL NKTVLSCKIK GMDIADCSAM PVGELLAFIH
TLSSKSFSAI RYELQKKLEN VVTIGLQYLT LDRTTDTLSG GESQRIKMVR HLGNSIEDLL
YIFDEPSIGL HAKDLDNISK IIRKIKEKGN SVLIVEHDPD IIKIADHVID VGPLSGVHGG
EIIYEGSFKG LLASKGKTGK YFSQKRTYRD NPRTAQTSIR IENASLFNLK NITVDIPRNV
LTVVTGVAGS GKSTLIGKVL PLQLPEVKII NQSPITGSQR STLLTYLDLS DKLRRLFASA
NKVSDRLFST NSLGACPDCK GLGVEKIDLA FMDDIEQPCD VCHGTGFDPK VLKYKFLHKN
IAEVLNLTVE EAMTFFEQYD FVTHFELLIA LGLGYLKLGQ RLSTFSGGER QRLKLSAELA
EANKILVLDE PSTGLHPADT GQLLQVLDRL VEQGNTVIVI EHNLDVIAQA DWILDIGPGA
GKYGGELVFQ GNVRQLLENK VSETAKYLKQ HLIA