Gene Cpin_3409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3409 
Symbol 
ID8359575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4215242 
End bp4218412 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content47% 
IMG OID644965582 
Productamino acid adenylation domain protein 
Protein accessionYP_003123077 
Protein GI256422424 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.84904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.154555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA ATATAAAAGG ATTCGAATTG TCACCCCTGC AGAAACGTAC CTGGCGTCTG 
CAGGCTGACA ATGGTCCGCT GTACTCACTG GCTTCCTTCC GTATTACTGG CGTCACTGAT
CCCGGCAGCA TTAAAACGCT ACTTGAAAAA CAGCTGGAGG CGCATGACAT CTTCCAAACG
CGCTTCGAAC AGGTAGTACA CATGCAGTAC CCTTTTCAGG TGCCCGGTTG GAAGAAATGT
TATGAAATTC AGGTGACCGA TATCTCCCTG GAAGAACGCG ATATGCAAAC CGTCATAACC
GATAACCTCT TTGAATCACT GGCTGCCGGT ACAGAACCGG CTGAAGAGGA TATACTGACG
GAAATAGCCA TCGTAAAGAC CCGGCCCGCT GAGTGTCTTT TATTAATAAA GATCAACGCG
CTGTCAGGTG ACACCGGTGT GATTATGAAT ATCGTGAATG ACCTGCTGGC CGCCATTGAT
GGTCTTTGCG AATTACTGAA CAACGAAAAC TACCCTTATG TACAGTTCGC ACAATGGCAG
CAGGAACTGA TGGATGAAAA CAATGAAGAG GCGGAGCAAT TCTGGCTTGA GAGAAGAAAT
AACCAGGAAC ATCACCACCA GTTGCCATTC AGCATCACTC CAAAAACGGA TGATAAACGA
TCATCCAGAC CGCTGACATT ATCCCTGGAT ATCAATGCCG CCTTACAGAC TGATATTAAA
CATTGCTGTA CAGAACAATC AGTAACGCCC GCTGTGCTGC TGAAAACAGC ATGGACAATA
CTCTTGGCGG ACTATATGGA CTACACCGGC AATTTCGTAA TAGGCAGTGT CGAAAATGGC
CGTCACTATG ATGCCTTCAC CTCCATTAAT GGTCCGCTGT TTAAAACAGT ACCTTTCAGA
ACGACGCTTG CAAAGACAGA TACGATACAG GCTGTTTTAC AGCGTGTGGC TGCAGAAACA
GAAATGATTG CGGAGTCGCA GGACTATTAC TTTAATATTC CCGGACAACA ACGCTGGCAG
GATGCCGTGA AGTTCGATCT GTTGTTCGAA TATCATGAGT TGCAACATCC GCTGACGATT
CATGTAAAGG CCCTACTGGA TGGTATTTAT GTACATACAG AACCATGTAG CCTGAAGTTG
TTTTGTTATG AGTATAGTCC GGTAGGTCTG GTGGCCGAAC TCTATTATGA TCCGACTGTG
TTAAACAACG CACAGGCACT GTTGATCAGG GAACGTTTCA CTTATATCCT GCAACAACTC
ACCGGAGACG GAACAGGAAC ATTGGCAGAT ATTCGTAATT GTACCCAACA GGAATATGCG
CACATCGTCG ATAACATGTA CCTGCCGGGT ATAACTGTAA ATAATGACCC TGCCGCCCTG
TCTATACCGG CTTACTTTGA ATCCTTGTTG CCTGCTGTGG CAGACCGCCC CGCGGTAGGC
TTCAAACATA AGCTGCTGAC ATATGCCGAA TTAAATGAAA AGGCAAATGC ATACGCCCAT
CACCTGATAC GCAGGTACGG TATAAAACCC GGCGATGTCG TAGCATTCCA GATACCAAGA
TCTGTCGACA TGGTCGTTGT TATCATGGGT ATTCTGGCGG CTGGCGCCGC ATTTTTACCA
TTGGATATTG CCGCGCCGGA AGAAAGAGTA AAATTCATTC TGCAGGACAG TGCTGCAAAG
GTGCTGATCA CGCAGTCTCA GTTAATACCA GCCCTGAAGG GCATCACACA ACACTGGGCA
TTGGAAGACG GTATCGGAGG TATTGAAACC CTGATCACCG CTCCGCAGAT CAATATCAGT
CCCGCTTCAG CGGCCTACCT GATCTATACA TCGGGTTCAA CCGGGAAGCC TAAAGGGGTG
TTGATCTCCC ACGCGGCCTT GTTAAATTAC AGCCGCTGGT TCAGTACTGT TTATGATATT
ACATCGGAAG ATACTTCGGT GTTGTTTTCA TCCATTGCCT TTGACCTTGG TTTTACAAAT
CTCTGGCCCC TGTTATTAAG TGGTGGCCAG GTTCAGTTGC TGGAAGAAAC ACAGCTGCTG
GATACCGCCG GTCTTTGCAA TCTGTTGTCG GAGAAAGGAG TTACGGTTAT AAAGCTGACT
CCATCGCATT TCAACCTGCT GCTGAATGAA CCCGGATTTG ATGACATGGC GCCTGACCTG
AAACTGAAGC TGATCGTATT GGGGGGTGAG GCAATAAGGC CGCAGGACCT GGAAACGTAC
TTCGCGTTAA ACCCGTCTAT TACTTTCGTT AACCACTATG GCCCTACAGA AACAACTATT
GGTACAGCTT CCAGACGTAT AAAGGCAGCC GATTTTCAGA CATTCAGACA AAGCCCTGTT
ATAGGTAAAC CGGTAACAGG CAACAGCATT TTTATACTGG ATGAGCAACA CTGCATATTA
CCTTATGGCA GTACCGGCGA AATCTGCGTA GCTGGCGCCG GATTAGCCAT TGGTTATCTG
AATGGTGCCA CACAGAATCA GGAAAAGTTC ATCGCTCACC CGCTGGACCC TGCCGCGAAA
TTGTACAAAA CCGGTGACGT GGGTCGCTAT ACCCTGAACG GTGAAATACA GTTCCTGGGC
AGGAAAGATT TCCAGGTGAA AATAAACGGC TATCGCATCG AACCGGAAGA GATCAGAAAC
GTACTGATAC TGTTTCCTGA AATACAGGAT GCCGCAGTTC TCTACGTACC GCAGGCAACG
GGAGAGGGTA GTCTGGCCGC CTACTTCAGT GCAGACGAGC CACTTGAAAA GAACAAAATA
CAGGAGTTTT TAACCCGGCA TCTGCCGCAG TACATGATAC CTGCATACTT TGTACAGGTA
AGTGCAATCC CACTGACGCC AAACGGTAAA ATAGACCGTA AAGCATTGCT GGAATTGCCA
CTGGAAAGAG CAGCCAGTGC TGAATATGTT GAACCCGAGA AGGAACTGGA GAAACAGATT
GCCCGGTTAT GGAAAGAAAT ACTTTGTGTA AACAAAATTG GTATCCATGA CAATTTCTTC
GACCTTGGTG GCAATTCGCT CAAGTTGATC CTTATGCTGA GAGAATTGTC AAAAATATTC
CCCGGAAAGG TAACACTCAC TGACCTGTTC CGCTACAATA CAATTTCTTC TATCATCCGC
TTCCTTGGAC AGGAAGAACC GGAAGCCGCT GTTGCTGGCT TCGAAATCTA A
 
Protein sequence
MENNIKGFEL SPLQKRTWRL QADNGPLYSL ASFRITGVTD PGSIKTLLEK QLEAHDIFQT 
RFEQVVHMQY PFQVPGWKKC YEIQVTDISL EERDMQTVIT DNLFESLAAG TEPAEEDILT
EIAIVKTRPA ECLLLIKINA LSGDTGVIMN IVNDLLAAID GLCELLNNEN YPYVQFAQWQ
QELMDENNEE AEQFWLERRN NQEHHHQLPF SITPKTDDKR SSRPLTLSLD INAALQTDIK
HCCTEQSVTP AVLLKTAWTI LLADYMDYTG NFVIGSVENG RHYDAFTSIN GPLFKTVPFR
TTLAKTDTIQ AVLQRVAAET EMIAESQDYY FNIPGQQRWQ DAVKFDLLFE YHELQHPLTI
HVKALLDGIY VHTEPCSLKL FCYEYSPVGL VAELYYDPTV LNNAQALLIR ERFTYILQQL
TGDGTGTLAD IRNCTQQEYA HIVDNMYLPG ITVNNDPAAL SIPAYFESLL PAVADRPAVG
FKHKLLTYAE LNEKANAYAH HLIRRYGIKP GDVVAFQIPR SVDMVVVIMG ILAAGAAFLP
LDIAAPEERV KFILQDSAAK VLITQSQLIP ALKGITQHWA LEDGIGGIET LITAPQINIS
PASAAYLIYT SGSTGKPKGV LISHAALLNY SRWFSTVYDI TSEDTSVLFS SIAFDLGFTN
LWPLLLSGGQ VQLLEETQLL DTAGLCNLLS EKGVTVIKLT PSHFNLLLNE PGFDDMAPDL
KLKLIVLGGE AIRPQDLETY FALNPSITFV NHYGPTETTI GTASRRIKAA DFQTFRQSPV
IGKPVTGNSI FILDEQHCIL PYGSTGEICV AGAGLAIGYL NGATQNQEKF IAHPLDPAAK
LYKTGDVGRY TLNGEIQFLG RKDFQVKING YRIEPEEIRN VLILFPEIQD AAVLYVPQAT
GEGSLAAYFS ADEPLEKNKI QEFLTRHLPQ YMIPAYFVQV SAIPLTPNGK IDRKALLELP
LERAASAEYV EPEKELEKQI ARLWKEILCV NKIGIHDNFF DLGGNSLKLI LMLRELSKIF
PGKVTLTDLF RYNTISSIIR FLGQEEPEAA VAGFEI