Gene Cpin_4707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_4707 
Symbol 
ID8360883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp5864838 
End bp5865974 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content51% 
IMG OID644966860 
Productglycosyl transferase group 1 
Protein accessionYP_003124345 
Protein GI256423692 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.903735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.103608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAG GTATTGTATC CATTATTAAA GAGCCATGGG GCGGGAGCGA AGAGCTCTGG 
GCCGATATGG CGGCAGAAGC GCTGAAGGAA GGGCATGAGG TGTTTGTATC CGCCCTGAAA
TGTGGTCCTC CGCATCCCAA AACCGTCAAC CTGCTGAAAA ATGGAGCCAA ACTTTTCTAT
CGCCGTGGAT TTGTACAACC GGGAATCCCA TTCCTGCAAC GCATCTTCCG CAAGGGCCTG
ATCATTCTCG CCAATAAATT ACAGAATCCT TTTCATCGTC TCTTCAAAGA GCGTCCTGAC
GTGATCTTTT ACGTGGGTAC TTCCTATTCC ATCGGAGAAG ACGTCCTCCT GCTCAAAGCA
CTGGACAAAA CACCCGCAGC CCTTTATATC AACTGTAACC TGAACCATGA CGTACGTGGT
TTCGGCGGTA CGCGCTATGA CCTGATCAAG GCCGCTTATC AGCGTGCAAG GAAGGTCCTG
TTTGTATCCG AAGGGAATCT GGAAATAGCC CGCAGACATC TCTGTTCCAA CATCGACAAT
GCAATGGTGA TCCGGAATCC GGTGAACCTC GCCAATATCG GTATACTGCC CTTCCCCGAG
GAGGAAACCG TGAATTTTGC AATGGTAGGT ATACTGGTAA CCGATCATAA AGGACAGGAC
CTGGTACTGG GCGCCCTGCA CCAGGAACAA TGGCGCGCCC GTAAATGGCA TCTGAACATC
TACGGTGCCG GTCTGGATGA AGGATATCTC AAACAGCTGA CCGCCTTCTA CGGACTGAAT
GACCGCGTTA CCTTTCACGG TAAAGTCAAC GATATCAAGG AAGTGTGGAA AAAAAACAGC
CTTTTATTGA TGCCCTCCCG CCAAGAAGGC ATGCCGCTTG CTGTAGTGGA GGCCATGGTG
TGTGGACGCC CCTCCGTACT GACGGATGTT GGCGGACACC GGGAATGGGT AACAGAAGGA
CAGCAGGGTT TTATAAGCCC CGGCGCTACC GTCAGGTCTA TGGCGGATGC CATGGAAAGA
GCCTGGGAGC AACGCGCTGA CTGGAAGCAA ATGGGTATTG CTGCCAACAC TGCCGCCATG
GAACGATATA ATCCCGTACC TGGTAAAACG ATACTGAAAC TATTATTAAC TGCATAA
 
Protein sequence
MKIGIVSIIK EPWGGSEELW ADMAAEALKE GHEVFVSALK CGPPHPKTVN LLKNGAKLFY 
RRGFVQPGIP FLQRIFRKGL IILANKLQNP FHRLFKERPD VIFYVGTSYS IGEDVLLLKA
LDKTPAALYI NCNLNHDVRG FGGTRYDLIK AAYQRARKVL FVSEGNLEIA RRHLCSNIDN
AMVIRNPVNL ANIGILPFPE EETVNFAMVG ILVTDHKGQD LVLGALHQEQ WRARKWHLNI
YGAGLDEGYL KQLTAFYGLN DRVTFHGKVN DIKEVWKKNS LLLMPSRQEG MPLAVVEAMV
CGRPSVLTDV GGHREWVTEG QQGFISPGAT VRSMADAMER AWEQRADWKQ MGIAANTAAM
ERYNPVPGKT ILKLLLTA