Gene Ccur_03840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_03840 
Symbol 
ID8374592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp449453 
End bp451276 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content49% 
IMG OID644993308 
Productputative collagen-binding protein 
Protein accessionYP_003150790 
Protein GI256826831 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value0.173471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA AACTCTCCTA CCTCATGAGC GTACTGCTTG TTGCAGGGCT TATGATTCCC 
GCGGGGTCGC TCCAGGCGTT CGCGGACGAT ACGACTACTG AGTCGAGCAC ACCGGCTGTA
TCAGTCGACG CGGTAGCTCC TACATCAACT GGTGTAGTGG ATACGACAGC AGCGCCTGCT
GCTGCAGATT CCACCGAGGC AACTGATGCC ACTGCTACTA TCAACAGCAC AGATTCTGTA
ACTGAAGAAA CCCCTACTGC TGACACGTCG TCAGCATCTT CAGCAACAGC CGATATGCAG
GCTACTGTGA GTGAAGATGC GACCACGAAT AGCCTTGATG TTGCAACGCC ACTGGCTGTA
TCGAGTCTTG CGACAGATGT ATCTGATGCT CTGTTGACCG CCGGCTTAAC CCGTGCGGGT
CCACTGCAGC GTCCTGGTGC TGGACAAGAA GTCTCTGTCA ACCTGACAGG TTTCAAGTTC
AAAAATATGG ATCATAACGA CACCACTGAA ATCCGTGCCA ATTACAACTT CTTTTTAAGT
ATGGATTGGG ATGCTAGTGG TAAATCGATT CATGAGGGCG ACTATTTCGA TATCAAATTG
CCCGATCAGA TGAAATTCCC TGAGGGCAAT TCCAACTTTG ACATCACCGA TAGCGACGGC
AATGTTATTG CGCATGGCGA AATTCAACCT GGACCAGACA ACAATGGTGG CAAGGTCCAT
GTTACTTTCA CCAACTACGT GAACAACCGT TATGATATCA AGGGTAACAT CCAGGTTAAA
ACGCGCTTCA ATACTTCAAG GATTACCCCT AACGAGAATA ATAACTTCAC TATTGAGACC
GGCGGCAAAA CGAAGACAAG TACGGTTAAA GTGATTGACC CATGGAAGCT TGTTGACGAA
ACCGTTGGTA AGTGGGGCGA ACAGGATGGC AGCGATCCAA ATACCACGGT CTGGTATGTT
CGTATCAACC ACATGAAAGA TAACCTGACT AATGTCGTGA TAAGCGATCA TCTCACCTCA
GATCAAGGGC TCGATGGTAT TCACTACATA GCGGGTTCAT TTGAATTGCA ACATGTCACC
ATGGACGAGT TTGGGCATAC GACATCGGTC ATTTCACGAA ATGATATCAA CAGCTCCGTT
CAGATTTCTG CCGATGGTAC TTCTTTTACA TACCCGATGG GCAACATCAA TGGTGACCAG
TACGTGCTGC GCTACAAGAC CACGTACAAG CCCGGTATGA AGCTGTTGAA TCACATTAAG
CTTGTTTCAA CTGAAAAGAC CAAGGAAGCC AGTTCAAGCT ATCATTACAC CGGCTCAAAC
GGTAACGGTG AAGGCAGCCT GTTCAGTAAG ATCAAAATCA TTAAGGTTTC TGCATCTGAC
GAGACGGTAA AACTTGCCGG TGCTGTATTC ACGATTACGA GCGTGTCAGA TCCGACAAGG
ACCTACAGCT TGACAACCGA TGAAAACGGC GAAGCAATTA CGGACAGACT GGTAGCAGGT
CAGTACACGA TCAAAGAGGT TACTGCGCCT GAGGGATATC TCGTAAACGA TGAAACGTAT
ACAGTAACCG TAACTTCGAA TGCGGCCACG ATTCAGACCG TGAAGGACGA ACAGAAGCCT
GAGGTTCCAG ACAATCCCAA TCCTGGTCCT AACCCAGAAC CGTCACCGAA CCCCGAACCT
TCTCCTAGCC CGGAGCCTCC GACACCCGAG TCAAGTGTAC TTCCGCTGAC AGCCGATGGA
TTTATGCCAC TGGCCGGCAT GCTTGGGTTT GCTGCTCTGG CTGGAGCGGG TCTTGCCGTG
CGGGCTTGGC GTTGCAGCCG ATAA
 
Protein sequence
MKRKLSYLMS VLLVAGLMIP AGSLQAFADD TTTESSTPAV SVDAVAPTST GVVDTTAAPA 
AADSTEATDA TATINSTDSV TEETPTADTS SASSATADMQ ATVSEDATTN SLDVATPLAV
SSLATDVSDA LLTAGLTRAG PLQRPGAGQE VSVNLTGFKF KNMDHNDTTE IRANYNFFLS
MDWDASGKSI HEGDYFDIKL PDQMKFPEGN SNFDITDSDG NVIAHGEIQP GPDNNGGKVH
VTFTNYVNNR YDIKGNIQVK TRFNTSRITP NENNNFTIET GGKTKTSTVK VIDPWKLVDE
TVGKWGEQDG SDPNTTVWYV RINHMKDNLT NVVISDHLTS DQGLDGIHYI AGSFELQHVT
MDEFGHTTSV ISRNDINSSV QISADGTSFT YPMGNINGDQ YVLRYKTTYK PGMKLLNHIK
LVSTEKTKEA SSSYHYTGSN GNGEGSLFSK IKIIKVSASD ETVKLAGAVF TITSVSDPTR
TYSLTTDENG EAITDRLVAG QYTIKEVTAP EGYLVNDETY TVTVTSNAAT IQTVKDEQKP
EVPDNPNPGP NPEPSPNPEP SPSPEPPTPE SSVLPLTADG FMPLAGMLGF AALAGAGLAV
RAWRCSR