Gene Ccur_13730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_13730 
Symbol 
ID8375578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1559043 
End bp1561934 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content48% 
IMG OID644994289 
Productputative collagen-binding protein 
Protein accessionYP_003151730 
Protein GI256827771 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones120 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAGAA AGTTATGTTG CCTCATAAGT GCTGTACTGG CATTCAGTAT CTGTATGCCT 
GCGGGAGCCT TGGAGGCGGT TGCCCTTGAT AACGAAGGAA GCACAAGCTC TGCGAGTGCA
ACGCAGGAAA GTCCTGCCAC TTCTATAGAG AACAGTGAAA ACAGCCGTGC AGTTGTAAAT
GCTGAGGCTC CCATGGCACA AGATGAGTGG CAAGCGAGTC AATCTGATAC AGCTAACGGA
AGCAAACAGA CATCATCTAC GGGCACCGTT GACTTGACAA GGATGATCGA GCCGGTAGAT
CTTAATACTT CTCACAAGGT GTCTAAGCGT GCTGCTTCTG GATCAAGTGG CGTTGAGCTT
CCTAATGTTG TTACGAGTGC CAAAGTTACT ACCGCTACTG GCACGACACC GGTTAATGTG
GGGGCGTGGC AAGCATTCAA GCTTAATTTC CATTATGCAC TTCCAAATCT TGGCGTGCAT
GCGGGCGACA CAACTACGAT AGAGCTTCCA GCCGGGTTCA AAAGCGCTCC GCCAACTGAC
TTTGTCATTC AAGACGGATC AGGCAATATT ATTGCGCGCG GTAAGACTGA CCCGACCAAC
ACCAAGTTTA TTATTACATA TACGGACTAC GCTGAGGGTA AATCGGATAT CTCTGGAGAC
TTCTCGGTTA ATGTTCAGAT CGATAATGAT GTACACACTC ATTCTGGCGT CTTACCAGTT
AATCCGGTGA TAAGCGGTGA AACAGTTCCG GCTGGCAATG TCAACTACAC GGTACATACT
GAAACAGCTG TGCCTATTAT TAAATCTGGC TGGGCTAACG CTTTCGATAC CACTAAGGGC
GTTTGGCAGG TAAAGATTAA TCAAGATGGC AAGGCGTATA CGAATGCAGT ACTTGATGAC
AGCCTTTTAA CTCCAGGCGT GTCGTTTATT CCTGGTACAC TTGAAGTATT CGAAGGAACC
TGGCAGCTTC AAGGCACGAG CTATAAACTT GTTGGGCAGA CCAATGTTAC AAGTCAGTAT
GCTTCAAAAA TTACTTATAA CGGAACAAAC TTTAAGCTGA ATCTGGGAAA TATCCCTGCC
ACTAAAGGGC TATTGGTACG TTTCCAAACC AAGATTAACT ACACACCTCT TCCCGGTGAA
AAGTTCGAAA ATAAAGCAAG TCTCACTGAT AACGGCGTAA CTAAAGAAAG TAAGGCCTAT
TACCTTCTAC CAACCTCTGG TGGCACTGGT GAAGGTTACA AGTACAAGAT TAATATCAAA
AAGACTGACG AATCAGGAAA TCCACTTGCC GGAGCAGTTT TTGATATTGT TCGTGCTCGT
TCTGGCGCGG TTGTTGGCCA AGTAACAACA AATGCGTCTG GCGAAGCAAG CCTGGGCGGT
CTTTTGCGTG ATGGTTATAT CATTAAGGAA ACTACTCCGC CTTCTGGATA TCTTGCTGCA
GCTGATCAGA CGATTGCCGA TACTGATTTC TCAACAGCCA CTCAAGACGT CACGCGCACG
TTTGTCGATA AAGCCATTCC GCCTACTGTT AATGTCTCCG TTGAAAAGAC GTGGAGCGAT
GCCGACAATC AAGACGGAAT GCGTCCGACT TCGGTGACGG TTCATTTATA TGCTGATGGT
GTCGATACCG GTAAGACAGT AACGCTTGAT GGAAGCAATT CGTGGAAAGA TACTTTCGCG
AGCCTCGATA AGAAGAATGC AGCTGGTAAC GATATCGTTT ATACCGTGGC TGAGGATCCA
ACTCCGTCAG GGTATATCGC TGCTGTTACT GGTTCTGCGG TCGCAGGCTT TACCATCACC
AACACCCATA CCCCTGAAAC CATCAATATC CCAGTAACCA AGAAGTGGGT TGGGCCAGAA
GGCTCATCAG TTACTGTCAA GCTTTTGGCT GATGGGGTAG ATAGTGGCAA GTCTGTCACG
CTTTCCTCAG CTAATAGCTG GAGCGATACG TTTACTAACC TGCCTAAGTA CAAAAATGGT
ACTGCTATTA CCTACACCGT TGATGAATCA TCAGTTACTG GTGTGGATGC AACCAAGTAC
ACAACAGCTA TAAGCGGCAG TGCTACAGCG GGATACACTA TCACTAACAC CAATAAAGAA
AAGATTGACA TCTCAGGAAC AAAGACCTGG AATGATGATG GCAATCGTGA TGGAGCGCGT
CCGTCTTCTA TCACCATCAA CCTTCTGGCC GATGGCACCC AAGTAGACTC AAAGGCAGTC
ACCCCGGATG CATCAGGTGC ATGGAGCTAT AGCTTTGCTG GTCTTGCTAA GTATTCTGCA
ACCGATGGTC ACCAGATTGC CTACACGATC ACTGAGAATG CTGTTGCTGA TTATTCAACA
ACCATTACTG ATTATGATGT CACAAACACT CATACACCAG CTCAAACATC GCTGACAGTT
ACTAAAGCAT GGAGTGATGA TAATGACCGC GATGGCGTGC GTCCTTCCTC TGTGGAAGTA
GTGCTCTATG CCAATGGAGT AGCAAAGGGA ACACCTGTTA CTCTCAATGC TGCCAATAAT
TGGTCATATA CGTGGACTGG CCTTGACCAG AAGGACAATG GCACCAACAT TGTCTACACG
GTCGATGAGC CCACTGTTCC CACTGGATAC ACCAAAGAGG TAACGGGGGA TGCCACCAGT
GGCTTTACCA TCACCAACAC CCATACCCCC ACTCCTCCAG AGCCGGGCCC AAATCCTGCT
CCCGAGCCCG ATCCAACACC AGCACCGAGC CCAGATCCTG ATCCAGACCC CAATGGCAAA
GGACCAGCTT CCATCCTTCC CAAAACCGCT GATGAAGGAA CTCTGTTTGC TGGAGCTGCT
GGTTTAGCTA TTCTCTCTGC TGTCGGTGGA GCGATTGCTG TGACTGCTCG CCGTCGCGAA
GAGCAGGATT AG
 
Protein sequence
MKRKLCCLIS AVLAFSICMP AGALEAVALD NEGSTSSASA TQESPATSIE NSENSRAVVN 
AEAPMAQDEW QASQSDTANG SKQTSSTGTV DLTRMIEPVD LNTSHKVSKR AASGSSGVEL
PNVVTSAKVT TATGTTPVNV GAWQAFKLNF HYALPNLGVH AGDTTTIELP AGFKSAPPTD
FVIQDGSGNI IARGKTDPTN TKFIITYTDY AEGKSDISGD FSVNVQIDND VHTHSGVLPV
NPVISGETVP AGNVNYTVHT ETAVPIIKSG WANAFDTTKG VWQVKINQDG KAYTNAVLDD
SLLTPGVSFI PGTLEVFEGT WQLQGTSYKL VGQTNVTSQY ASKITYNGTN FKLNLGNIPA
TKGLLVRFQT KINYTPLPGE KFENKASLTD NGVTKESKAY YLLPTSGGTG EGYKYKINIK
KTDESGNPLA GAVFDIVRAR SGAVVGQVTT NASGEASLGG LLRDGYIIKE TTPPSGYLAA
ADQTIADTDF STATQDVTRT FVDKAIPPTV NVSVEKTWSD ADNQDGMRPT SVTVHLYADG
VDTGKTVTLD GSNSWKDTFA SLDKKNAAGN DIVYTVAEDP TPSGYIAAVT GSAVAGFTIT
NTHTPETINI PVTKKWVGPE GSSVTVKLLA DGVDSGKSVT LSSANSWSDT FTNLPKYKNG
TAITYTVDES SVTGVDATKY TTAISGSATA GYTITNTNKE KIDISGTKTW NDDGNRDGAR
PSSITINLLA DGTQVDSKAV TPDASGAWSY SFAGLAKYSA TDGHQIAYTI TENAVADYST
TITDYDVTNT HTPAQTSLTV TKAWSDDNDR DGVRPSSVEV VLYANGVAKG TPVTLNAANN
WSYTWTGLDQ KDNGTNIVYT VDEPTVPTGY TKEVTGDATS GFTITNTHTP TPPEPGPNPA
PEPDPTPAPS PDPDPDPNGK GPASILPKTA DEGTLFAGAA GLAILSAVGG AIAVTARRRE
EQD