Gene Cpin_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3416 
Symbol 
ID8359582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4228305 
End bp4231367 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content47% 
IMG OID644965589 
Productamino acid adenylation domain protein 
Protein accessionYP_003123084 
Protein GI256422431 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0257508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.239752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAT TAAACGCCGC CATACAAAAA CAGTCCTTTG AGTACTGGTT GAAAAAAATT 
ACCGTACAGA ATGACTGGTT AGCGGATTAT ATGGAAGAAC AGAAAGAATT GCCCTTACTG
CTGTCTGAAG AAATACCCGG CAGCGTAGTG GAAGACCTTC GTAAGGTAGC CCGGAATGAT
ATTTCAGCGT ACGTATTGTT CTTCTCTGTT TACTCTTTTC TCTTATATCG TTACTTCGGT
AAAAACTGCC TGATCACCTC AGCTGATCTG AAACTGTTTT CTCCATCATC AGCAGAGCGT
CTGCTGTTTT ATGCCAGCGC AATCAGTGAA GACACAACAG TCAAAGAGAT GATTACCGGT
TTTCAACAGG AACTGATGGC AGTACTGGAG AAACGTGATA TAGATATTTA TGCATTGCTG
GGAGCGGCAG AAAATAAAGG CGTCGATACT GACCGCCTTC AATGCTTCGC ATTTCAGCAC
GAAACAGCCA GTGAACAATC ACAGCTGGCT GATAAAAGCA GACTGAAACT GCTGATCCGT
GAAAATGAAG GCGCCCTCAC AGTGCAATTG CAGGTGAACG GTTTCAGAGA AGGAAAACTC
ATCGGACAAC AGTTTATTGA TCACTATGCA TATCTGCTGT CCGAAATACA ACACCTGCTG
GATCGTCGTA TCGCAGACAT AGAACTGCCT TTGCCCGCAG TTGCCAAAGT AGCAATACCG
GAAGATACCA CAACCGTATT GGATCTCTTC AAGACACAGG CACAACAGCA GCCGGAACAA
ATTGCACTGG TAAGCCGTGG TATACAATAT ACCTACGCAA AACTGGATGA CGAAAGCAAT
AAACTGGCAA ACTATCTGCT GACAGAAGAA CAGGTCACAA AAGGACAGCC AGTCGCATTA
TTGTTGCCAC GCAGCGAATG GATCCTGACA GGTATACTGG GCATACTGAA AGCAGGCGCC
GCTTTTGTAC CGCTTGATCC TGCATGGCCG GTTAACAGAC TACAGTACAT CCTCGATGAT
GCAGGTGTAG AGGTATTGCT GACAACAACA GAATACCTGG CCCAGCTGCC TTCATTCAAA
GGGAAATTGT TCGCTTTTGA TATACAACAG GATATGCTGC CCGTTGCGCC CGCACCGGAA
ATCTCCATCA ACGGATCAGA TGCAGCCTAT ATTATGTATA CCTCCGGTAC CACCGGACAG
CCTAAGGGTG TCGTTATCGA ACACAAAGGA ATCGCTAACT ACGCCACCTG GCTGCATAAT
GACTTTCAAT TTGGCGCAGG TGACGCTACG ATCCTGGTAA CCTCCTATGC GTTTGACCTG
GGATACACCG CCATATGGGG TGCTATCCCA TGGGGTGCTA CACTGCATAT CCCCGGCGAA
GACTATAGTA AAATGCCGGA AAAATTATAT GATTACCTCG CCGCACAGCA GATCAGTTTT
ATCAAATTAA CGCCTTCACT ATTTCATCTG CTGCTGAATG TTACCAACAA TACGAACCGC
TTATCGCTGA AGAAAATATT TCTCGGTGGA GAAATGATCC GGCCGACAGA TATCGCTGCT
TTTACTGCGG ACTATCCCGA CACGCTGTTC GTGAATCACT ACGGACCGAC TGAAAGTACC
GTAGGTTGCA TCTTCCACCG TGTACGTCGA GAAACATTTT CTTCTTTTGC TGCCCGCCCC
GTGATCGGTC GCCCCATCAG AAATACAGAA GTGCTGATCC TGGATGAATA CCTGCACGTG
TTACCGCATG GTGTATGGGG AGAGATCTGT GTCAGTGGAC CCGGTGTAGC AAAAGGCTAT
CTGAATAAAC CAGAGGTGAC TGAAAAGCAG TTCGTCAGCA AAGACCTGGC CGTGAATGGT
CGCGTTTACA GAACCGGAGA CTATGGCCGT TACTTAGGTA ATGGTACGAT TGAACTGAAA
GGACGTAAAG ATAACCAGGC AAAGATCAGA GGATACAGAG TAGAACTGGA AGAGATCGAA
AAATGCCTGG CGAATTATCC TGATATCCAT GAAGCAGCTG TCTTGCTGCA ACGTGCAGCA
GGTGATCAAC ATGCAAAACT GGTGGCTTTC TATACATTGG CAGAGACCAG AAAAGAGGTA
AGCTCAGAAG CCATCATTAA TTTTCTGAAA GCCTATCTGC CTGAATATAT GATCCCTTCG
GATATCATAC CCCTGGATGT GATGCCGGTA AATGAAAATG GAAAGCTGAA CAGGCAGGAA
TTATCAGCGT CTCTTGCAAC AAAGAATACG CGTCGTCGTA GTAAACAGGT GTCACCGGTA
AATGAGACCG AACGGATTAT CCTGAAAGTA TGGCAGGACG TGTTAAAGAG AAATGATATC
AGCACAATAG ATAGCTTCTT CGAACTGGGT GGTAACTCCT TGCTGCTCGT ACAGGTGAAT
ATCACGCTGA ACGAGTTTTT CCCTTCTCTG ACCATAACCG ATCTTTTCGC ACACATCAAT
ATAGCTGCAC TGGCGGCACA TATCGACCGT ACAGATGATA CAACAGGTAT GAGTCTGGCA
GGTACAGGGA TTCACTTCCC GGCTGACTAT TTCGGTGCAG AAACGCCGTC CGCTGAGTTA
TTTGTAGCCT TAAGCGAAGA AGCAAGTGCC AGTGTAAGTG CGTCCGCGAT CGCCTGTGGT
ATCTCTGAAA TAGATATCTG CATTGGTACT TTCGCTTACG CACTAGGCAA ATCAGCTGAA
ACCGGAGATG TACTGTTTCA CCTATATGAA GGAGAAGGCA AGCTGAAACG CCTGTCTGTT
CCCGTACATC GTGCAGAAAC AAAAGAACAG TTATATAGTA CTGCTGCCTT ACAGAGAGCA
CATCCGGAAC TGGTTTTCCA GGCGGACAGT GTAATCCGCC CGACGACAAA AGACGAGATG
ACCTGGCCAT TGATCAGCCT GAACGGCAAA CCCGCCGTCG CCAATGGTAT ATTCGATATC
ATACTTACTA TTCAGCAGAA AGAGGGCGTA CTGGCCTTCT CGCTGGAACA TACGCCCCGC
TTAAATCCTG AAAAGATAGA AGCTTTACTG GACCTCTTTG TCAGTATGCT GGAACGCGCC
TGA
 
Protein sequence
MEILNAAIQK QSFEYWLKKI TVQNDWLADY MEEQKELPLL LSEEIPGSVV EDLRKVARND 
ISAYVLFFSV YSFLLYRYFG KNCLITSADL KLFSPSSAER LLFYASAISE DTTVKEMITG
FQQELMAVLE KRDIDIYALL GAAENKGVDT DRLQCFAFQH ETASEQSQLA DKSRLKLLIR
ENEGALTVQL QVNGFREGKL IGQQFIDHYA YLLSEIQHLL DRRIADIELP LPAVAKVAIP
EDTTTVLDLF KTQAQQQPEQ IALVSRGIQY TYAKLDDESN KLANYLLTEE QVTKGQPVAL
LLPRSEWILT GILGILKAGA AFVPLDPAWP VNRLQYILDD AGVEVLLTTT EYLAQLPSFK
GKLFAFDIQQ DMLPVAPAPE ISINGSDAAY IMYTSGTTGQ PKGVVIEHKG IANYATWLHN
DFQFGAGDAT ILVTSYAFDL GYTAIWGAIP WGATLHIPGE DYSKMPEKLY DYLAAQQISF
IKLTPSLFHL LLNVTNNTNR LSLKKIFLGG EMIRPTDIAA FTADYPDTLF VNHYGPTEST
VGCIFHRVRR ETFSSFAARP VIGRPIRNTE VLILDEYLHV LPHGVWGEIC VSGPGVAKGY
LNKPEVTEKQ FVSKDLAVNG RVYRTGDYGR YLGNGTIELK GRKDNQAKIR GYRVELEEIE
KCLANYPDIH EAAVLLQRAA GDQHAKLVAF YTLAETRKEV SSEAIINFLK AYLPEYMIPS
DIIPLDVMPV NENGKLNRQE LSASLATKNT RRRSKQVSPV NETERIILKV WQDVLKRNDI
STIDSFFELG GNSLLLVQVN ITLNEFFPSL TITDLFAHIN IAALAAHIDR TDDTTGMSLA
GTGIHFPADY FGAETPSAEL FVALSEEASA SVSASAIACG ISEIDICIGT FAYALGKSAE
TGDVLFHLYE GEGKLKRLSV PVHRAETKEQ LYSTAALQRA HPELVFQADS VIRPTTKDEM
TWPLISLNGK PAVANGIFDI ILTIQQKEGV LAFSLEHTPR LNPEKIEALL DLFVSMLERA