Gene Cpin_5045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_5045 
Symbol 
ID8361221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp6291565 
End bp6292833 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content49% 
IMG OID644967194 
Productglycosyl transferase group 1 
Protein accessionYP_003124679 
Protein GI256424026 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.022612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGA GAATTGCATT TATCAGCGAG CATGCCTCGC CACTGGCTGC TTTGGGCGGT 
ACTGATGCGG GAGGACAAAA TGTATATGTA GGGGAGGTAG CCAGGCAACT GGCCCATAAA
GGATATCAAA TCGATATTTT TACCCGGCGG GATGAGAAGA AGCTACCAAA GGTTGTCATT
TTCAGCGACA TGATCCGGAT CGTACATGTA GATGCAGGTC CGGCAGAAGA TATTCCCAAA
GAAACGCTCT TGCAATATAT GCCTGCTTTT ACCAGCGATA TGTTGCAGTT TATACAGGAA
GAACAGCTGT CTTATGAGCT GGTCCACGCA AATTTCTTTA TGTCAGCTCT CGTAGCGATG
GACATAAAGG CGGCACTCGA TATACCGTTT GTGGTCACCT TTCATGCATT AGGTCACATC
CGCCGTATTC ACCAGGGTAA TAATGACGCC TTTCCAGAGG AGCGGATTGC CATTGAAGAA
AGGGCAGTAA GAGAGGCTAG TCTGATTATT GCCGAATGCC CGCAGGACAG AGATGATCTT
ATCAACTATT ATCATGCACC GGTAGATAAA ATTACCATCA TACCTTGTGG GGTCAATACG
GAAGAATTCT ACCCGCTGAA TAAGTCCGTT GCACGTTCCC TGCTGAAGCT ATCACAGGAT
GAGCGCATTC TGTTGCAGCT GGGAAGAATG GTACCGAGAA AAGGGGTAGA TAATGTGATC
GTGGCCCTGT CCAAACTGAA ATTCAGGGAC CTGAGAATGA AACTACTCAT AGTAGGAGGA
GACGCTGATA CGGTGAATGA ACTGCACAGA CTACGTTCGC TGGCGGAAGA GCTGAATGTC
AGTGAGCGGG TTGTTTTCGT CGGACAGAAG GAGCGGGAAG AGCTTAAATA CTATTACGCG
GCAGCCGATC TGTTTATTAC GACACCCTGG TACGAGCCAT TTGGTATTAC GCCATTGGAG
GCGATGGCCT GTGGAACGCC GGTCATCGGA AGTAATGTAG GCGGTATTAA ATTCAGCGTG
CTGGAAGGAA AGACAGGAGC GCTTGTACCG CCCAAAGATG CGGATGCACT GGCTGCAAAG
ATCAATTCAT TGTTGCGTTC TCCGGTGAGA CTACGGGAGA TGAGTGCGAA TGCGGTCCGG
AGGATCAACA AACTGTTTAC CTGGGAACTG GTAGCCCAGG ATATGCAGGC GGTATATGAA
AAGATCATCG GGCGTAGCCG TGTTGTCTAC GCAAAAAATT CCGGTAAAGG CGAAAGGCTC
AGTTTATGA
 
Protein sequence
MGRRIAFISE HASPLAALGG TDAGGQNVYV GEVARQLAHK GYQIDIFTRR DEKKLPKVVI 
FSDMIRIVHV DAGPAEDIPK ETLLQYMPAF TSDMLQFIQE EQLSYELVHA NFFMSALVAM
DIKAALDIPF VVTFHALGHI RRIHQGNNDA FPEERIAIEE RAVREASLII AECPQDRDDL
INYYHAPVDK ITIIPCGVNT EEFYPLNKSV ARSLLKLSQD ERILLQLGRM VPRKGVDNVI
VALSKLKFRD LRMKLLIVGG DADTVNELHR LRSLAEELNV SERVVFVGQK EREELKYYYA
AADLFITTPW YEPFGITPLE AMACGTPVIG SNVGGIKFSV LEGKTGALVP PKDADALAAK
INSLLRSPVR LREMSANAVR RINKLFTWEL VAQDMQAVYE KIIGRSRVVY AKNSGKGERL
SL