Gene Cpin_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1052 
Symbol 
ID8357166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1259652 
End bp1262396 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content46% 
IMG OID644963206 
Productsurface antigen variable number repeat protein 
Protein accessionYP_003120751 
Protein GI256420098 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03303] outer membrane protein assembly complex, YaeT protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000329593 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TATTTCCTAA GAGCCTACTG GCCATAGTAT TATGTTGCAG TGCCGGCATT 
CGTGTGTCCG CCCAGATCAG GGACACTAGT GTTACGCCGG TTCCAACACC TGACCAGCAA
GCGGCGGGAT TGAATTTATC AGGTCCGTTA CAGGCACAGC CTTACGAAAT TGCTGACATT
ACCATTGTAG GGACGCAATA CCTGGATAAA TCTCTGCTGA TTTCCCTGTC AGGCCTCAAT
GTCGGTGATA AAGTGGTTTA TCCCGGCGGT GACCAGTTCG CAAAGGCCAT CCAGAGTCTC
TGGGGCCAGC GTTTGTTTGC CAACGTTGCT ATTTATGTTA CCAAAATAGA AGACGGGAAG
ATTTGGCTGG AAATAGAACT CCAGGAACGC CCTCGTCTGA ATAACTTCGT TTTCAGAGGT
GTTAAAAAGT CTGAAACAGA AGAACTGATA AAAAAAGCCG GCCTGCGTAA AGGATCCGTT
GTAACCGAGA GCATGAAACA GAATGCAATC GGCGTGATCT CTAAACACTA TGGCGATAAA
GGCTTCCGCA ATGCTACTGT TAACATTACC GAAAGAACAG ATACCTCCCA GGTAAATGCT
TCCGACCTGG TAATTACCGT AGTAAAAGGT GGTAAAGCAA AAGTGAACGA TATCCAGATC
GTGGGTAACG ACAATATTGA CGATACAAGA GTGAAGAAGA AAATGAAAGG TACCAAAGAG
CGTACCCGCT TCACTTTATA TCCGGATATC GAGTCGGTAT ATGAAGATTC AACCGAACTG
CAGGAAAACT ACTGGAAAAC CTTCGGTTTC CTGTATCCTT CCCGCACTAT GGAACAACTG
GATCCGTATT TCCGTTTCAA ATTATTCTCT TCTGCCAAGT TCAATGAGAA CAAATACGCA
GAAGACAAAG AGAAAGTAAT CGCTTATTAT AACACACAGG GTTACCGTGA CGCCGTTCTG
GTGAGAGATA CTACTTATCG TGCGCACAAT GGTGGTGTAA ACGTAGCCAT GCAGATCAGC
GAGGGTAAGA AATATTACTT CGGTAACATT ACCTGGAAAG GTAACAGCCG TTATAACGAC
TCCCTGCTGA CCCGTGTACT CGGTATCAGA AAAGGAGATA CCTACAACCA GGAACTGCTG
CAGAAACGCT TATTGTCTTC TGAAGGTGGC GACATCGGTG GTATGTACAT GGACTTCGGT
TACCTGTTCT TCCGTGCTGA CCCTGTAGAG GTGGCGATCC ATGGCGATAC CATCGACTAT
GAGATCCGTA TCTCTGAAGG TCCGCAGGCA ACCATCAAAG AAGTACGTAT TGCCGGTAAC
GAAAAAACCA ACGAACACGT AATCCGTCGT GAGTTGCGTA CGCTGCCTGG TGAGAAATTC
AGCCGTACTG ACCTGATCCG TTCCAACCGC GAGATCGCGA ACCTCGGTTT CTTCAACCCT
GAGAAGATCG GTATGGACCC TATCCCGAAC ATCCAGGATG GTACCGTGGA TATCAACTAC
ACTGTAGAAG AGAAAGCGAA TGACCAGCTG GAATTATCTG CAGGATGGGG TGGTTACATC
GGTCTGACCG GTACTTTGGG TGTGACCTTC AACAACTTCT CCCTGCGTAA CATTTTAAGA
AAGGAAACAT GGGATCCATT ACCAAGTGGT GACGGCCAGA AATTATCTGT GCGTGTATCA
TCCAACGGTA AAGCCTATCG CTCCTATAAC TTCTCATTTA CCGAGCCATG GTTAGGTGGT
AAAAAGCGTA ACCAGTTCTC TGTCAGCTTC TACAGCAGCT ACCAGAACCC GAACGCTTAT
TCTGCTTATA TATATGGTAG CACCCTGTCT AACAATGCTT ACTTCAAAGT ATTGGGTGGT
TCCGTATCCC TCGGTAAACA GCTGAAATGG CCTGATGACT TCTTCACGCT GATCTACTCC
CTGAACTATC AGCAGTATAA ACTGAAGAAC TATAACTATT TCAATATTCC TGGTTTCTCC
AGCGGTACTT CCAATAACGT CAACATTAAA CTGACCCTCG CCCGTTCATC TGTGAACCAG
CAGATCTATC CGAGCAGTGG TTCTAACTTC CTGTTGTCCG GACAGTTCAC GCCTCCTTAC
TCCATATTTA ATCCTAACAG GGATTACAAA CTGGAGTCTA TTCAGGATCA GTTTAACCTG
ATTGAATACC AGAAATATCG TTTCAACGCG GAGTGGTACG TTCCATTGAG CAGACCTAAA
GGTTCTGATA ACAAAGTGTT CGTACTGAAA GTAGCCGCTA AATTCGGTTA CATCGGCCGC
TACAACAACC GCACAACATT GTCTCCATTT GGTCGTTTTG AGCTGGGTGG CGATGGTCTG
AGTAACTTCG CGATCTATGA CCGTGATATC ATCTCCCAGA GAGGTTATCC GGTATACTAT
ACCTCGGATC CTAGGATGAA CTCAGAAACC GGACAACCAA CAGGTTATGA AGGTTTCACC
GTGTTCAACA AATATGTGAT GGAGTTACGT TATCCGTTCA GTCTGAACCC AAGCTCTACC
ATCTTTGGTC TGGCGTTCGT GGAGGCAGCC AATGGTTACC GCGATGTAAG AGACTTCAAT
CCGTTCAGAT TACGTCGTTC AGCAGGTCTC GGTATGCGTT TCTACCTGCC GATGTTTGGT
CTGCTTGGAT TTGACTATGG TATCGGTTTC GACCGTCTGC AGTCCGGCAA TGGTCTGAAA
GATGCTGCTA AGTTCACCTT CATGCTGGGC TTCGAACCGG AATAA
 
Protein sequence
MKKLFPKSLL AIVLCCSAGI RVSAQIRDTS VTPVPTPDQQ AAGLNLSGPL QAQPYEIADI 
TIVGTQYLDK SLLISLSGLN VGDKVVYPGG DQFAKAIQSL WGQRLFANVA IYVTKIEDGK
IWLEIELQER PRLNNFVFRG VKKSETEELI KKAGLRKGSV VTESMKQNAI GVISKHYGDK
GFRNATVNIT ERTDTSQVNA SDLVITVVKG GKAKVNDIQI VGNDNIDDTR VKKKMKGTKE
RTRFTLYPDI ESVYEDSTEL QENYWKTFGF LYPSRTMEQL DPYFRFKLFS SAKFNENKYA
EDKEKVIAYY NTQGYRDAVL VRDTTYRAHN GGVNVAMQIS EGKKYYFGNI TWKGNSRYND
SLLTRVLGIR KGDTYNQELL QKRLLSSEGG DIGGMYMDFG YLFFRADPVE VAIHGDTIDY
EIRISEGPQA TIKEVRIAGN EKTNEHVIRR ELRTLPGEKF SRTDLIRSNR EIANLGFFNP
EKIGMDPIPN IQDGTVDINY TVEEKANDQL ELSAGWGGYI GLTGTLGVTF NNFSLRNILR
KETWDPLPSG DGQKLSVRVS SNGKAYRSYN FSFTEPWLGG KKRNQFSVSF YSSYQNPNAY
SAYIYGSTLS NNAYFKVLGG SVSLGKQLKW PDDFFTLIYS LNYQQYKLKN YNYFNIPGFS
SGTSNNVNIK LTLARSSVNQ QIYPSSGSNF LLSGQFTPPY SIFNPNRDYK LESIQDQFNL
IEYQKYRFNA EWYVPLSRPK GSDNKVFVLK VAAKFGYIGR YNNRTTLSPF GRFELGGDGL
SNFAIYDRDI ISQRGYPVYY TSDPRMNSET GQPTGYEGFT VFNKYVMELR YPFSLNPSST
IFGLAFVEAA NGYRDVRDFN PFRLRRSAGL GMRFYLPMFG LLGFDYGIGF DRLQSGNGLK
DAAKFTFMLG FEPE