Gene Cpin_3588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3588 
Symbol 
ID8359755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4470233 
End bp4473193 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content48% 
IMG OID644965758 
ProductTonB-dependent receptor plug 
Protein accessionYP_003123252 
Protein GI256422599 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00353353 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00053116 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA GGGGAATATT CCTGACAGCA TTGCTGTTGG CACTGTGGAG TTTTGCTATG 
GCACAGCAGG ACGTCACAGT AAAGGGACAC ATACAGGACA AGCAGGGCAA CTCTCTTCCC
GGCGTTTCCA TCAAGATCAA AGGAACAGCT AAAGGCACGG CCAGCCAGCC AGATGGTAAC
TTTACCATTC AGGCGCCTTC CAACGCCATA CTGGAACTGT CTTACGTGGG CTACGAAACA
CAGGACTTTT CATTAGAGGG CCGTACCCAG GTCACCATTA CCCTGTCTGA CACCAAAACA
GATCTGAGCG AAGTAGTGGT TATCGGTTAC GGTACCCAGA AAAAGAAAGA TGTTACTACA
GCCGTAGTCG CTGTTAATAC CAAAGATATC GCTGAGCGTC CGATTACACA GACGGCTGCC
GCCCTCCAGG GTAAAGCTGC CGGTGTGCAG GTAACGCAGC CTTCCGGTAA ACCGGGCTCC
GCATTCTCAG TAAGAGTGCG TGGTGCGACT TCCGTACAGG CGGGTAACGA ACCTTTATAT
GTGATCGATG GTGTGCCAAC CACCGATACA AGGGATCTGA ATACCAATGA TATCGCTACT
ATTACCGTAC TGAAAGATGC TTCCAGCGCC GCTATCTATG GTGCAAGAGC ATCCAATGGC
GTGGTACTGA TCACCACCAA GAGAGGTCGT TCCGGTGATG CGGTGATCAA CTTCAATACT
TATTATGGTG TATCTAAGAT CGGTAAGAAG ATCGACGTAC TGGATCCTAC ACAATATAAT
GACCTGATGA AGGAAATGGG TTTCAACGTA GTAACGCCTA CGGTGACTAC CAACTGGCTG
GATGAAGTAT TCCAGACGGG TCGTAACCAG AACTATCAGT TATCAGCTTC CGGTGGTAGT
GAGAAATCTC AGTATTTCAT CTCCGGCGCT TATACCAAAG ATGACGGTAT GGTTAAACCT
GCGGAATATA ACCGCCGCGT TTTCCGTGCT AACCTGGACA ATCAGCTGAA ACCATGGATG
AAACTGACCA CTAACATCAG CTACTCCAAT GTGGATCTGA AAGATGTAAA GGATAATGCA
AACGCAGGTC GTAACGCAGC GATCCTGGGA GCACTGAATG CGCCTCCGAT CATCGGTATC
TATGAGACAG ATGCAGACGG TCACCAGCGC TATACCATGA ACCCATATAA ATCCGGTTGG
GATAACCCGC TGGCGGCTAT TGAAGGTCCG ACGCAAGGTA CAAAAGACAA CCGTATACTG
GGTAATGCCG CGCTGGACAT CAATTTCACC AAAGATCTGA AGTTCCGGAC CAACTATGGT
ATCGACTATA CCAATCACCA ATACGACTAT TACCTGGATT ATATCATGAC CACTCCGGGC
CGTCTGGACC ATGGTTATGC TACCTCCCAG CGTTATAATA CAACGACCTG GCTTTGGGAA
AATACCCTGA ACTATGACAA AAGCTGGAAG AAACACAATC TCTCCGCTTT GGGAGGTGTG
ACTTCTCAGA AGAACAATTA CGCTGAAACT AATCAGACAG GTCGTGATTT CCCTGCTGAT
CCGACTGTAA AAACGCTGAA CGCCGCTAAC CAGATCACAG GCGATACCAG AGAATCCCAG
TGGTTCCTGA TGTCTTACCT GGCGCGTGTG ATGTATAACT ATGACAGCAG GTATCTGTTG
ACGGTTAACT TCCGTGCGGA TGGTTCTTCC AAACTGGATA AAAAACATAA ATGGGGTTAC
TTCCCGTCTG TATCTGCCGG CTGGCGTATT TCTTCCGAGC CGTTTATGCA GGACGTGAAA
GCGATTAATG ACCTGAAATT AAGAGCGGGC TGGGGACAGA ACGGTAACGT AGAAGGGCTG
AGTCCATATG CGTCCCTGGG TCTGAATAAT TTCGTTCGTC AGACGCCGAC AAGTCCGCTG
TCAGGTCCTG GTATCACCCT GCCTGCAGGT GCGCCGAACC CGGATCTGAA ATGGGAAACC
ACTACACAGA CCAACGTTGG TATCGACCTG AGTCTCTTTG AATCCCGTCT GACTTTCTCT
GCTGATGCGT ACATCAAAAA GACAAAAGAT CTGCTGTTAA ATGTACCATT CCCACGTTCA
GCGGAATACC AGTATATGCC GCGTAACTCC GGTGCACTGG AGAACAAAGG TCTGGAGTTC
CTGGTTTCTT CTGTGAACGT TGATAAGGTT GACGGACTGC GTTGGAGCAC TGATTTCAAT
ATCGCTTTCA ACCGTAACCG TCTGACTGAT CTGCAACTGA CACAGGTATA TAACTATGCT
GCGCCAGAGA ACAGAGATTT TATCATCACA CTGAGAAAAG GACTGCCACT GGGTACTTTC
TACGGATATG TAGCAGAAGG GGTTGATCCT AAGACAGGAG ACATGGTGTA TAAAGATGTA
AATGGTGATG GTAATGTAAC GCCTACCGAC CGTACTGTGA TCGGTAATGC ACAGCCTACT
TTCACTTATG GTCTGAACAA TAACCTGTCT TATAAAAGCT GGTCATTCTC ATTCCTGTTC
CAGGGTTCTC AGGGTAATGA AGTGTTCAAC GCCTCCCGTA TGGAAACAGA AGGTATGTAC
GACAGCAAGA ACCAGTCTAC CGAAGTACTG CGCCGCTGGA CAGCAGCTGG TCAGGTAACT
GACATCCCAC GTGCTACCAA CGGTGATGTC ACCAACTCCC GTACTTCTTC CCGTTTCGTA
GAAAATGGTT CCTTCCTGCG TATGAAGAGT GCAACACTCT CTTACAATCT GCCGAGGAGT
GCATTGTCTG CTATGCACAT CAGCCGACTG ATGGTATATG CTACTGCACA GAACCTGTTT
ACTATTACCA AATACCAAGG TTTTGATCCG GAAGTGAATG CTTACTCAGG TGATCCTGCG
AATGGTGTAA CACTGGGTAT TGACTACGGT ACTTATCCGG TAGCAAGAAC TTATGTCCTT
GGTTTAAACC TGTCCTTCTA A
 
Protein sequence
MKKRGIFLTA LLLALWSFAM AQQDVTVKGH IQDKQGNSLP GVSIKIKGTA KGTASQPDGN 
FTIQAPSNAI LELSYVGYET QDFSLEGRTQ VTITLSDTKT DLSEVVVIGY GTQKKKDVTT
AVVAVNTKDI AERPITQTAA ALQGKAAGVQ VTQPSGKPGS AFSVRVRGAT SVQAGNEPLY
VIDGVPTTDT RDLNTNDIAT ITVLKDASSA AIYGARASNG VVLITTKRGR SGDAVINFNT
YYGVSKIGKK IDVLDPTQYN DLMKEMGFNV VTPTVTTNWL DEVFQTGRNQ NYQLSASGGS
EKSQYFISGA YTKDDGMVKP AEYNRRVFRA NLDNQLKPWM KLTTNISYSN VDLKDVKDNA
NAGRNAAILG ALNAPPIIGI YETDADGHQR YTMNPYKSGW DNPLAAIEGP TQGTKDNRIL
GNAALDINFT KDLKFRTNYG IDYTNHQYDY YLDYIMTTPG RLDHGYATSQ RYNTTTWLWE
NTLNYDKSWK KHNLSALGGV TSQKNNYAET NQTGRDFPAD PTVKTLNAAN QITGDTRESQ
WFLMSYLARV MYNYDSRYLL TVNFRADGSS KLDKKHKWGY FPSVSAGWRI SSEPFMQDVK
AINDLKLRAG WGQNGNVEGL SPYASLGLNN FVRQTPTSPL SGPGITLPAG APNPDLKWET
TTQTNVGIDL SLFESRLTFS ADAYIKKTKD LLLNVPFPRS AEYQYMPRNS GALENKGLEF
LVSSVNVDKV DGLRWSTDFN IAFNRNRLTD LQLTQVYNYA APENRDFIIT LRKGLPLGTF
YGYVAEGVDP KTGDMVYKDV NGDGNVTPTD RTVIGNAQPT FTYGLNNNLS YKSWSFSFLF
QGSQGNEVFN ASRMETEGMY DSKNQSTEVL RRWTAAGQVT DIPRATNGDV TNSRTSSRFV
ENGSFLRMKS ATLSYNLPRS ALSAMHISRL MVYATAQNLF TITKYQGFDP EVNAYSGDPA
NGVTLGIDYG TYPVARTYVL GLNLSF