Gene Cpin_3703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_3703 
Symbol 
ID8359871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp4642036 
End bp4644225 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content46% 
IMG OID644965872 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_003123366 
Protein GI256422713 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000380193 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.301221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAA TATATTTCGT TAAAGCCCTG CTGAAAAAGA AATGGTGGAT CATCGTAAGT 
ACCATCATCG CCATTGTTGC CGCGTTTGTT TTTACCCTCG GCAAACCCCG TATGTATGCC
TCAGTAACCC AGATAGCAAC CGGATTTACC GTTAATGACC AGGTGAAACT CCGTGACGAG
AATGTGAACA TATTTGAAGC AGACGTAAAA TTTGACAACG CGATCGAAAC CATCAACTCA
CCGGTGGTGA TCGGAATGCT GTCTTATAAC CTGTTACTCC ATGATCTGAC TACGACCAAA
CCATACGTAC AGCTTGAGGC GAACGACCTG AAATCAGAAG CCTATCGTAA GGTAGACAAA
CAGGCGGCAG TGACCATCCT GCGTAACAAG CTGGATTCAT TGCAGGTGCT CTCCTCTTAT
AACCCGGTAG AACGCGATAT CCTGGGATAT CTGAAACTAT ACGGATATGA CTACGAAGCC
ATCCGCAAAC ACCTGAATGC AGGTCGTGTA CAACGTACCG ATTACCTGGA GATCGTGTAT
ACTGCGGAAA ATCCTGAGCA GGCAGCTTAT ACTGTAAATA CCGTTTACCG CGAGTTTATC
CGCTATTACA GAAGCATGCG CTCAGAAAGA TCTGTTGAAA GTGTTGAATC TTTCGATCAG
CTGGCTGTTC AGAAGAAAGC CGAACTCGAT AAGAAAGTAG AAGCACTGCG TGCCTATAAA
GCATCTGAAG GACTGCTGAA CGTTGAAACA GCCAGTGGTA ACGAACTGGA CCTGATCAAA
CAGTTTGAAA AAGGACTGTC TGATGAACAG GCGAATTATA ATACCATTAC CTCTTCCCTG
GAAAGTGTAA ATGGTCGCCT TGCCAGTGCC AATGCGGGGA AAACTGTTTA TACCAATGCT
AACAGTGAGA TCATCGAATT ACGCAAACAG ATCAATGATC TGAATGACGA CCTGACTCAG
AAAGGAGGTA ATGATGATGC CATGCGCACT AAACTGGCTG GCCTTCGTTC CCAACTGCAA
AAGAAACTGG GCGCTGCTAC CTCCGGCACA CAGAGCGCTA CCACCAAAGA TGCGCTCATT
CAGGAGAAAG CCAACCTGGA AGCACAACAG AACGCTTCCC GGCTGAACAT GAAAAATCTC
CAGAGCCAGA TCTATAAACT AAGAGGTTCT GTAGGTTCCT ATGCCAATAA AGAAGCAACT
GTAAGTAGCT TACAGTCTGA AGTGGACATG GCACAGGAAG AGTACAATAA ACTCAAGGAA
AAACTGAATG CGGCACAGGA CAACCAGACT ACACCTGACC TGAACTTCAA ACAGGTACTG
AAAGGTCAGC CGGCATTTAA ACCGGAGTCT TCCAAACGCG TAATCATTAT GGGTATGGCA
GGTATTTCCG TATTCCTGCT GACGTCCTTT ATTGTATTGC TGCTGGAATT CCTGGACGGT
TCCCTGAAGT CTCCTTCCGT ATTTGAGAAA CACCTGGACC TTCGTCTGAT CAGCAGTGTC
AACCATGCTG ACCTCAATAA ATACAGCATC CTGGAAGTAT TACAGCGCAC CACGTTGCCT
GATAAAACAA CCAAACAACG TCAGAACACT TTCCGTGAAC TGCTGCGTAA ACTGCGCTAT
GAAGTAGAAA GCAGCGGTAA GAAGTCATTC CTTATCACGA GTACGGAATC CCGCCAGGGT
AAGACCACGC TGACACAGGC CCTTGCCTAC AGTTTAAGTC TGAGCAACAA GAATGTACTG
GTGATAGATA CCAACTTCTG TAACAACGAT ATCACGGTAC AGATGGAAGC TGCGCCAACA
CTGGAATCAT TCTCTGTACC GCCAACTGAG TTGAGTATAG ACAAGGTGAA GGAAATCGTA
ACAACCTATG CAGTATCTGG TATTGAAGTA ATCGGTTGTA AGGGTGGCGA CTATACCCCA
TCTGAAATAC TGCCGAAAAA TAATCTCTTA AATTACCTCC CCTTCCTCAC ACATCATTAT
GATTTCATAC TGCTGGAAGG CGCTCCGCTC AACGACTATA CCGACAGTAA AGAGCTGGCG
CAATACGTAG ATGGCGTAAT CGCTGTGTTT TCGTCCAAGT TGTCCCTTAC CCAGATAGAT
CGTGAATCCA CACAGTTTTT TGAGACACTT GGTGATAAAT TCGTGGGAGC CGTACTTAAT
AACGTACAGG AAGAATACCT CGAATTATAA
 
Protein sequence
MDIIYFVKAL LKKKWWIIVS TIIAIVAAFV FTLGKPRMYA SVTQIATGFT VNDQVKLRDE 
NVNIFEADVK FDNAIETINS PVVIGMLSYN LLLHDLTTTK PYVQLEANDL KSEAYRKVDK
QAAVTILRNK LDSLQVLSSY NPVERDILGY LKLYGYDYEA IRKHLNAGRV QRTDYLEIVY
TAENPEQAAY TVNTVYREFI RYYRSMRSER SVESVESFDQ LAVQKKAELD KKVEALRAYK
ASEGLLNVET ASGNELDLIK QFEKGLSDEQ ANYNTITSSL ESVNGRLASA NAGKTVYTNA
NSEIIELRKQ INDLNDDLTQ KGGNDDAMRT KLAGLRSQLQ KKLGAATSGT QSATTKDALI
QEKANLEAQQ NASRLNMKNL QSQIYKLRGS VGSYANKEAT VSSLQSEVDM AQEEYNKLKE
KLNAAQDNQT TPDLNFKQVL KGQPAFKPES SKRVIIMGMA GISVFLLTSF IVLLLEFLDG
SLKSPSVFEK HLDLRLISSV NHADLNKYSI LEVLQRTTLP DKTTKQRQNT FRELLRKLRY
EVESSGKKSF LITSTESRQG KTTLTQALAY SLSLSNKNVL VIDTNFCNND ITVQMEAAPT
LESFSVPPTE LSIDKVKEIV TTYAVSGIEV IGCKGGDYTP SEILPKNNLL NYLPFLTHHY
DFILLEGAPL NDYTDSKELA QYVDGVIAVF SSKLSLTQID RESTQFFETL GDKFVGAVLN
NVQEEYLEL