Gene Xcel_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXcel_0038 
Symbol 
ID8647533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXylanimonas cellulosilytica DSM 15894 
KingdomBacteria 
Replicon accessionNC_013530 
Strand
Start bp41725 
End bp42936 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content75% 
IMG OID 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_003324637 
Protein GI269954848 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTCG CCGACGGGGT CGAGGCCCGT CAGGTCGCGG TGAGCGACGC CGGTGCCGCC 
GCCGACGTCG TCGGTGCCAC CCGCCGCATC GGTGCCGCCC TGCTGGTGGC GTCACCCGCC
GATGCGAACG CCGTCGTCTC GCCGTCGAGC GTGGCCGTCG CCCTGTCGAT GCTGGCGGAC
GGCGCCCGGG GCGGCACCCT CGCCGAGCTC GACCAGGTGC TCGGCGCCAC GGGGGAGGAC
CGGCGCGACG CCGTCGCCGC CCTGCGCGGC ACGCTCCTTC GCCACGACGG CGACCCCGCC
GTCGTCCGCG ACGAGGAGCT GCCCGACGAT CCGGTGGTCC ACCTCGCCGC GCAGGTGGTC
GTGGACGACC AGCTCACGCC CGACGACGCC TACCTGACCA CGCTCGCCGA CGTGTACGGC
GCGGGCGTGC AGCGCGTCGA CCTGGGCTCG GACGACGGCA AGTCCGCTCT CGACGCCTGG
GTGCAGCACC ACTCGGGCGG GTTGGTCGAG GAGTCGGCGA TCACACCGAA AGACTCGCTG
CGCCTCGTGC TGCAGGACGC GGTGGTGCTC GCGGCTCGCT GGTACACCCC GTTCCCCGGG
CACGCCACGG GCGACCGGCC GTTCACGACC GCGGACGGCA CCGAGGTGTC GGTGCCCACC
ATGAGCGGCG AGGCGCCCCG GGCGTACGCC GAGGTGGACG GCTGGCGCGC GGCCCGGCTG
CCGTACACCG GGCATGAGCT GCACGCCGAC GTGATCCTGC CGCCCGACGG CGTCAACCCG
GCCGCCGCGC ACCCGGAGCT GCTGGCCGCG CTGGCGGCCG CCCTCGACGA CGCCGAGGAT
CAGCCGGTCC GGGTCACGCT GCCCGTCCTC GATCTGCGTC CCGACCCCCT CGACCTGCGC
GACGCGATCG CCACCCTCGG GGCGCCGACC GTGCTCGATC CGGGTGCGGC GGATCTCACC
GGGATCGGCA CCGACGACGC CGGCGAGAGG CTGTACCTGG GCCAGGCGAT GCAGCAGGCC
GTGCTGCAGC TCGACGAGGA GGGCACCCGG GCGGCCGCCG TGACCGAGCT GGGAGCCGAG
GCCGGCTCCG CCCCCGTGGA CCGCCCCGTC GAGCTGACCC TCGACCGCCC GTTCCTCATA
GAGATCGCCC ACACGTCGAC GTCCTGGCCG CTCTTCCAGG CCGCGATCCG CGACCCCCGC
CCCGGTGGTT GA
 
Protein sequence
MLVADGVEAR QVAVSDAGAA ADVVGATRRI GAALLVASPA DANAVVSPSS VAVALSMLAD 
GARGGTLAEL DQVLGATGED RRDAVAALRG TLLRHDGDPA VVRDEELPDD PVVHLAAQVV
VDDQLTPDDA YLTTLADVYG AGVQRVDLGS DDGKSALDAW VQHHSGGLVE ESAITPKDSL
RLVLQDAVVL AARWYTPFPG HATGDRPFTT ADGTEVSVPT MSGEAPRAYA EVDGWRAARL
PYTGHELHAD VILPPDGVNP AAAHPELLAA LAAALDDAED QPVRVTLPVL DLRPDPLDLR
DAIATLGAPT VLDPGAADLT GIGTDDAGER LYLGQAMQQA VLQLDEEGTR AAAVTELGAE
AGSAPVDRPV ELTLDRPFLI EIAHTSTSWP LFQAAIRDPR PGG