Gene Cpin_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpin_1172 
Symbol 
ID8357288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChitinophaga pinensis DSM 2588 
KingdomBacteria 
Replicon accessionNC_013132 
Strand
Start bp1393516 
End bp1394574 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID644963328 
Productvon Willebrand factor type A 
Protein accessionYP_003120871 
Protein GI256420218 
COG category[R] General function prediction only 
COG ID[COG4245] Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGGT TACCCATTTA TTTCCTGATA GACGTTTCTG AGTCTATGGT AGGAGAACAA 
ATTCAGTTTG TTGAGGAAGG ACTTGCAGCC ATCATAAAAG AGTTGAAATC TGACCCATAT
GCACTGGAAA CTGCATGGGT ATCTATCATC GTGTTTGCCG GTCAGGCAAA AACAATAGTA
CCATTGCAGG AAGTGATCAG CTTCTATCCG CCTAAGTTCC CAATCGGCGC AGGTACTTCC
CTCAGTAATG GCCTGGGACA CCTGATGTAT GAAATGCGTA AGAATACCAT ACACACCACC
GCTACACAGA AAGGCGACTG GAAACCGATC GTATTCCTTT TCACAGACGG TACACCAACT
GATGATACAA GCGCTGCTGT CCGTGAATGG AAGCAGAACT GGCAAAACAA ATCAAACCTG
ATTGCAATCT CCTTCGGAGA TGAAAATAAT CTGAGTGCCC TGAAGGAACT CACAGAGACG
GTATTGTTAT TTAAAAACGC AACGCCACAG TCCTACAAAG AATTCTTCAG ATGGGTGACT
GCATCCATTA AATCCAGCAG TGTGAGCGTC AGCAACAACG GTTCTGGTAT AGAGCTAGCC
AAGCTGAGCG ACGGTGTGAT TGAAAAGGTG GATGTGGATA AGCTGCCACC TGTAAAGCCT
GGAGTCATAG ATACCAATTA CGCCATCTTT GCCGCCAAGT GCCAGAACAC CGGCAAACCT
TATCTGATCA AGTATAAGAA AGGGTTGCGT CCTGTAAATA TCAGCGGGCT CGATATGCAG
TCTATGGGAT ACCGCCTCAA CGGTGCATTT CCTGTAGATA ATATGTATTT TGAATTGTCT
GCCGAAGGCG GTGTGCTCAA CAGCAAGGTG AGCACTGATG AACTGGAAGG ATTTCCTACC
TGTCCGTGCT GCGGTAACCA GTACGGTTTC AGTGTCTGTT CCTGCGGAAA GGTACATTGC
ATAGGACAGG AGGAGGTAAG TACCTGCTCC TGGTGTGGTA AACAAGGCAA ATACGGCTTT
GGTGACGGGC ACCTGGATAT TAACAGAACT CAGGGATAA
 
Protein sequence
MRRLPIYFLI DVSESMVGEQ IQFVEEGLAA IIKELKSDPY ALETAWVSII VFAGQAKTIV 
PLQEVISFYP PKFPIGAGTS LSNGLGHLMY EMRKNTIHTT ATQKGDWKPI VFLFTDGTPT
DDTSAAVREW KQNWQNKSNL IAISFGDENN LSALKELTET VLLFKNATPQ SYKEFFRWVT
ASIKSSSVSV SNNGSGIELA KLSDGVIEKV DVDKLPPVKP GVIDTNYAIF AAKCQNTGKP
YLIKYKKGLR PVNISGLDMQ SMGYRLNGAF PVDNMYFELS AEGGVLNSKV STDELEGFPT
CPCCGNQYGF SVCSCGKVHC IGQEEVSTCS WCGKQGKYGF GDGHLDINRT QG