Gene Xcel_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXcel_1948 
Symbol 
ID8649478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylanimonas cellulosilytica DSM 15894 
KingdomBacteria 
Replicon accessionNC_013530 
Strand
Start bp2104758 
End bp2106590 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003326525 
Protein GI269956736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.584968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA CCACGTCGAG CCCGACCCGT CACGGCGCAG GCGGGGCGCA CCCGGTCGGC 
TGCGGCTGCG GGGGCGGCGG CCAGTGCACC TACGGCCCGT TCACCCGCAA CAGCTACTGG
TACGGCAAGC TGATGCTGCC GCAGGACTTC ATCGACGAGC AGCAGTACGT GCGCGACAAG
ATCCGCCACC ACAACCAGCG GCTGCACGGC TGCGGCGTCG CGTGCGGGCT CGTCGTCGAG
CAGGACACGG CGCCGGACTG CCGGGACCGG ATCGTCGTCA TCACGCCCGG GACCGCTATC
GACTGCTGCG GCAACGAGAT CATCGTGACC GACCGCTACC GGCTCGAGCT GGCGACCCTG
CCCGCACTCG CGGCACTGTC CGGACCTGGA GCCGATCCCG AGGCCGTGCA CGAAGTGCGG
CTGTGCCTGC GCTACCGCGA GTGCGACACC GATCCGGTGC CGGTGCTGTA CGACGACTGC
GGGGGCGACG ACGGCCAGAC CGCCCCGAAC CGGGTGCTCG AGTCCTGGGA CGTCGACGCC
GTCGTCCTCC CGCCCGGGCC GGACGAGCCT GCGCCGGACG AGCCTGCGCC GGACGAGCCT
GCGCCGGACG AGCCTGCGCC GGACGAGCCT GCGCCGGCGG AGGCTGGGCT GGAGGAGCCC
GGGCTGGACG AGCCCGGGCT GGACGAGCCC GGGCCCGAGC CCGAGCCCGA CGTTCCGGCG
ACGGGCGCCT GCACGGAGCA CTGGAACACG CTGCCCGGGT GCCCGATCTG CGAAGACTCG
GCGTGCTCGT GCGTCGTGCT GGCCACCATC CACGGGTACC GGCCCGGCTT CGTGGTCCTC
GACGCCGACG CCGAGGCGAC CGCCGAGGCC GACCTGGCGG CCCAGATCGC CCGCATCGAC
AACCACGCAG GCCGGAGCGT GCTGCGCAGC ACGCAGGTGA TCAGCGAGAC CGTCGAGTGC
CTGCTGGAGC ACGGCGGCAC GGGAGGGGAG ACCGGGCCTG CGGGGCCAGA GGGACCAGTG
GGGCCGGCGG GGCCAGAAGG GCCAGAAGGG CCAGAAGGGC CGGCGGGGCC AGAGGGGCCC
GCGGGGGAGA AGGGCGACAC GGGTGAACCC GGGCCGCAGG GTCCACCGGG GGCCGCCGGT
GCCGCCGGTG CAGACGGTGC GACGGGTTCC GCAGGGGTGC CCGGCCCACA AGGGCCCGCA
GGCCCGGGGC TGGAGGCGGG CCTGGTCCAG ATCGCCGCCC TGAGCTGGAC GCACGCCGAC
ATGATCCTGG TCCAGGACCT GGAGACGCTG GAGATCGACG GCCGGCGGCG CCGGGGCGTG
ACGATCCAGT TCACCGAGAC CGTGCACCTG GCCCCGGAGG GGTTCACGCT GCCTGACGTC
CAGCACGTGC TCACGGTCGA GGCGCCGCAC GTGCGGTACA CGTTCCCGCC ACGAGAGGCG
GAGGACCGGG AGGTGGCCGC GAAGCAGGCC GAGCTCGACG CCTTCTACCG GTGCCGCTGT
CACGTGCTCG GCCAGGTGGT GCTGACCGAG GTGCAGGCGA TCGACGCGAC CGGCCGGATC
ACGTTGGCGA CGGACAAGAC GAGCGAGCCC GACGCCCTGT CGTTCGTCTT CCATGAGCGC
TTCCTCGACG CCTTGTTCGG CGCACTGGGG GACCCGCGCG GTGTCGACCT GTGGGTCAAG
CTCCGCGGCG AGTTCGTGCT GGACGCGCGA GAACGCGCGG TGGACGCGGA GTTCGCGCGC
GCCGGTCTCC CCACGGGCGA CCGCCCGAAG GGCGAGAAGC ACGGCATCCA GGGCGGCACC
TTCGAGAGCT GGCTCTACCC GGTGCTCGAC TGA
 
Protein sequence
MSTTTSSPTR HGAGGAHPVG CGCGGGGQCT YGPFTRNSYW YGKLMLPQDF IDEQQYVRDK 
IRHHNQRLHG CGVACGLVVE QDTAPDCRDR IVVITPGTAI DCCGNEIIVT DRYRLELATL
PALAALSGPG ADPEAVHEVR LCLRYRECDT DPVPVLYDDC GGDDGQTAPN RVLESWDVDA
VVLPPGPDEP APDEPAPDEP APDEPAPDEP APAEAGLEEP GLDEPGLDEP GPEPEPDVPA
TGACTEHWNT LPGCPICEDS ACSCVVLATI HGYRPGFVVL DADAEATAEA DLAAQIARID
NHAGRSVLRS TQVISETVEC LLEHGGTGGE TGPAGPEGPV GPAGPEGPEG PEGPAGPEGP
AGEKGDTGEP GPQGPPGAAG AAGADGATGS AGVPGPQGPA GPGLEAGLVQ IAALSWTHAD
MILVQDLETL EIDGRRRRGV TIQFTETVHL APEGFTLPDV QHVLTVEAPH VRYTFPPREA
EDREVAAKQA ELDAFYRCRC HVLGQVVLTE VQAIDATGRI TLATDKTSEP DALSFVFHER
FLDALFGALG DPRGVDLWVK LRGEFVLDAR ERAVDAEFAR AGLPTGDRPK GEKHGIQGGT
FESWLYPVLD