Gene Cpin_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3401 
Symbol 
ID8359567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4192243 
End bp4195497 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content47% 
IMG OID644965574 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003123069 
Protein GI256422416 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.369364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA GAATCATTCA TACCGTGTTT GATCAGCAGG CTGTTACAGC TCCGGAAAGG 
GTGGCTGTTG AAGAAGGCCG TCGATCATGG ACCTATAAAG AGCTGAAAGA AAGCTCCGAA
GCATTGACGC AGTACCTGCT GCATTATAAT CCGGCAACAG GTACACCCGT CGCTGTGATG
CTGCCTCCTG GTTTTTCCCT GGTAAGCGCA CTGCTGGCAG TTTTTCGTTC CGGTAATATT
TATATGCCCC TTGATGCATC ATTGCCCGCA AAAAAACTAC ATACCATCTT TGAACAGACA
CGTCCTGCGG TATGTGTCAC TACCCTTGCA CTGGCATCAG TAGCGGAGAA TATTATCCGT
GAACAGTCTG CATTCAACTG CACAATGATC GTCCTGGACG AGGAGCTGCC GGTTACGGTA
AAACATTTTG AAGACAGCAC TTACGCCGGT GCAGAAACAC CAGGCAGTAT TGCCTTTCAG
CCTTTCCCGG AGATCAGACC AGATGATGCC AATTATATTA TTTATACCAG TGGATCGACC
GGAGAAGCGA AAGCAATAGT AGGCTGCCAT GACAGTCTGA GTCATTTTAT TCATTGGGAA
ATGAATGAGT TTAAACTGGA CAACGGCGTT CGCGTCAGCC AGTTGAGTCA GTTCACTTTT
GACGCTTCCC TGCGTGATGT CTTCGTACCC CTGAGTATCG GAGGTACTTT ATGTTTTCCG
CCTGCCGGTA GCCGTACCAA CATCCCCCTG CTGATAGAGT GGCTGGAAGA ATCGCAGATC
AACCTGGTGC ATTGTGTGCC TTCTATCTTC AAACTGATTA CCAGATCACT CAATACTAAT
ACCGGCAAAC AGTTATTACC TGCCCTGAAA CATATCTTAA TGGCGGGAGA ACGCTTGTAC
AGCAAGGATG TCAGCCAATG GAGATGTATT GCAGGAGAAC ATGTGGAACT GGTGAACCTG
TACGGTACTT CGGAAACAAC GATGGCCAAA ACCTTCCACC GTATTAAAGC CGTACCGGAA
GATCCTTCTG CAGTGATACA TGTAGGTCGG CCGCTGAATA ATACAATGAT CGCTGTCGTT
AATGGCAGTC GCCTGTGCCG TGCAGGTGAG ATCGGAGAGA TCTACATTGT GACGCCTTTC
ATGACTAAAG GTTATTATAA AAACGAAGCA CTGACCCGCA CTGTCTTTGT ACAGAACCCG
CTTGTAACAG ACCGCGAAGA GATTGTACAT CGTACGGGAG ACTATGGTGT ATACCTGGAG
GACGGATCCG TAGAGGTCAT AGGCAGAAAG GATGAACAGG TAAAAGTCAA TGGTGTCAGA
GTAGAGCTGG GCGAAGTAAA ACAGGCAGTA TTACGTCTGG ATGGAATCTC CGGAGCTGAG
ATCATTGCCC TGAAAAATAC AGCAGATGAA AATGAGCTGA TTTGCTATTA TACTTCTACA
GAAGTGAAAG AAGATGTCCT GAGAACACAC CTGGAATCTG AACTGGCCCG CTACATGTTA
CCCGCTGCGT TGATCAGAAT GGAAGAGTTC CCGTTGACCA TCAATGGTAA GGTAGATAAG
AATGCATTGC CTAAGCCGGA GAAAGTATTG ATATCAGATG ATGCGTATGT TGCCCCGCAA
ACGCCAACAG AGAACAAACT GGAAGCAATG TGGCAGGAGT TGCTGGGACT GACCCGCGTG
GGTACGGCTA TCAACTTTTT CAGAGTGGGT GGTGCTTCGC TGAAGGTGAT TAAGATGGTA
TCGCAGATCT TTCACGTGTT TAATGTATCG CTGACGTTTG CAGACGTGTT TGTCAATAAT
ACCATCAGGC AACTGGCCAC ACTTATCGAT GAAACAGTAA AAACAGGCGG CGCCAACATT
GTACCACTAC CCGAGAAAGA GTACTATGAT GTTACCTATG CACAGCGCAG GTTATGGATC
TATGATAAGC TTGGCGGACA AAAGAACCTG TATAACATCG TACACGCAGT AGAGCTAAAA
GGGAATATAA AACCGGCGCT TATCCGTGAA GCACTGGTAG CGTTGATTAC CAGACATGAG
ATGTTACGGA CTACTTTTAT CGAAGTAGAC GGCGAACCCA AACAACGTGT TCACCCCGTG
AATAGTGAAC TGATCCCCTT CGCTTATTTT GACAGACAGC AGCAACTGGA ACTATATGGA
GGTATACCAT CCATCGTTTG GGCAGAACAG CAGCGGGAAT TTAACCTCGA AACAGGTCCG
CTCTTTTTCA ACACATTATT GCAGCTGGAC AACGACCATT ACGCAATGGT TTTCAACTGG
CACCACATTA TCGGTGACGG ATGGACACAA GACGTATTAC TCAACGACCT CCTGGTGCTG
TACAACACAC TCGAAGGAAG AGGAGAGCAG ACATTGAAAC CACTACGCTT CCAGTACCGT
GATTATGCAG ATTGGATGAA TCGTCAGCTC CAGGGTGAAC GACTGAAAGA AATGGAGGAA
TACTGGGCCA CCAGGTTTGC AGCTCCATTT ACGCCAGCTA CTTTTCCCGC ACAAACAGAC
AGAAACAAGG CGCGTGCCGG CATCGGTCAT AGCATAGATT TTTTGCTGGA CCAGGAGACA
ACCCGCCAGC TCAGGGCTTT AACACAGGTT TCTGCCGTGA CGGATTATAT CAGCCTGCAC
GCGATTGTCA ATATACTACT ATATCATTAT TCCGGTACTG CCGATATCGT AACCGGAGCG
CCTTTTTCCG GCAGACCCAG ACTGGAACTA CAGGATCAGG CAGGGTTTTA TGTTAATCTG
ATGCCCTTGC GTGTTACACT GGATCCCGAA GATAACTTCC TGACAGTACT CGATAAAACA
AGGGAATGTA TAACGGGCGC ACATAAGTAT CAGGAATACC CCATAGATAT GCTGGTACAA
CGCCTGGGAC TGGATGCCCG CTTCGGTCGT ATGCCAATGT TTAACATACT CATACAGTCA
CAGAATAACC TCGAATACCC CATCAGTGAT ATAGAAGGAA TGTCAGTTAA ACAGATAGAG
CTGACCGTCG CTACCAGTAA GGTAGATGTG ACATTCAACT TCCAGGAGAA CGGAGATGAA
ATACTGGCTT CTATCGAATA TGATACAGAG CTCTATACAG AGAAAGGTAT CCGCCTGGTC
ATAGAGAACT TCCTGACCGT CATTCGCACT TTAAGCAGAC TCCCTGCGAC AAAGATCAAA
GACATCACAA TCGAACGTGC CGTAGACGAG GAAGAAGAAG AACAATCTTT TATGAATGCG
CTTTCACAAC TATAA
 
Protein sequence
MNKRIIHTVF DQQAVTAPER VAVEEGRRSW TYKELKESSE ALTQYLLHYN PATGTPVAVM 
LPPGFSLVSA LLAVFRSGNI YMPLDASLPA KKLHTIFEQT RPAVCVTTLA LASVAENIIR
EQSAFNCTMI VLDEELPVTV KHFEDSTYAG AETPGSIAFQ PFPEIRPDDA NYIIYTSGST
GEAKAIVGCH DSLSHFIHWE MNEFKLDNGV RVSQLSQFTF DASLRDVFVP LSIGGTLCFP
PAGSRTNIPL LIEWLEESQI NLVHCVPSIF KLITRSLNTN TGKQLLPALK HILMAGERLY
SKDVSQWRCI AGEHVELVNL YGTSETTMAK TFHRIKAVPE DPSAVIHVGR PLNNTMIAVV
NGSRLCRAGE IGEIYIVTPF MTKGYYKNEA LTRTVFVQNP LVTDREEIVH RTGDYGVYLE
DGSVEVIGRK DEQVKVNGVR VELGEVKQAV LRLDGISGAE IIALKNTADE NELICYYTST
EVKEDVLRTH LESELARYML PAALIRMEEF PLTINGKVDK NALPKPEKVL ISDDAYVAPQ
TPTENKLEAM WQELLGLTRV GTAINFFRVG GASLKVIKMV SQIFHVFNVS LTFADVFVNN
TIRQLATLID ETVKTGGANI VPLPEKEYYD VTYAQRRLWI YDKLGGQKNL YNIVHAVELK
GNIKPALIRE ALVALITRHE MLRTTFIEVD GEPKQRVHPV NSELIPFAYF DRQQQLELYG
GIPSIVWAEQ QREFNLETGP LFFNTLLQLD NDHYAMVFNW HHIIGDGWTQ DVLLNDLLVL
YNTLEGRGEQ TLKPLRFQYR DYADWMNRQL QGERLKEMEE YWATRFAAPF TPATFPAQTD
RNKARAGIGH SIDFLLDQET TRQLRALTQV SAVTDYISLH AIVNILLYHY SGTADIVTGA
PFSGRPRLEL QDQAGFYVNL MPLRVTLDPE DNFLTVLDKT RECITGAHKY QEYPIDMLVQ
RLGLDARFGR MPMFNILIQS QNNLEYPISD IEGMSVKQIE LTVATSKVDV TFNFQENGDE
ILASIEYDTE LYTEKGIRLV IENFLTVIRT LSRLPATKIK DITIERAVDE EEEEQSFMNA
LSQL