Gene Tcr_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0020 
Symbol 
ID3760385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp23428 
End bp24573 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content44% 
IMG OID637784726 
Productpolysaccharide deacetylase 
Protein accessionYP_390291 
Protein GI78484366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCAG CACGGAAACT ACGTTCAATC GTTACGAGTA AACCCATCCT ATTTTTAAGT 
GGCATCGTCA CTCTGGTTGC AATCATTTCC GCATCAATCT TTTTAACGTC TGGCGCAACC
CCTCCGCCAA AAAAAGCGGT TTCAAGCCAA TCGTCTGTTG AAAACTCAGA CAGCGCGGTG
ATTTTAATTT ACCACCATTT TGGGAAAGAT GAATATCCCA GCACCAATAT TCGCTTAGCG
CAACTGGATG CTCAACTCAA CTACCTTGAA CAAAACCATT TTACTGTCTG GTCGTTATCC
CAATTAGTCA ACACATTAAA AAGCCGAGCA CCTATTCCAA ATAAAACCGT GGTTTTTACC
ATTGATGATG CTTGGTCCAG CGTTTATACA GAAGCCTTCC CACGGTTTAA AAAACGAGGC
TGGCCCATGA CGATTTTTGT AAATACCGAT GCGATCGATA AAGGTTACCA ATCGAATATG
ACTTGGGAAC AAATGCGAGA AATGCAGCAA TATGGTGCGG AATTCGCCAA TCATGCTAAA
ACGCATCAAA AATTGGTGCG ACAGCCAGAT GAATCTCATG AGGCTTGGCA GACGCGGGTC
ACACAGGAAA TTAAGGTGGC GCAACAACGC TTAAAGTCGG AACTTGGAGA AAACACCAAT
CAAACCAAAT TGTTGTCTTA TCCTTACGGC GAATACTCTG AAGCCTTAGC CAACCTTGTT
CAAAAAATGG GCTATGTTGG CATTGCTCAA AACTCGGGCG CTGTTGGATA TCAATCTGAT
CTAAGAGCCC TCATGCGCTT TCCAATGAGT GAAGTTTATG CCGACATGGA CGCCTTCAAA
TTAAAGGTCA ATACCCATGT TTTTCCGGTC AAAAAAATCA CGCCTTTTGA TCCGGTCATC
ACTGAAAACC CTCCTAAACT GATTTTAGAG TTCACCAGCC CTCCTCAGCG CAACATTCAA
TGTTTTAACC AGCATGGCGA GCCTTTGTTG CTCGATTGGG CCAGCGAAAC CAAATTAGAA
ATCACCAGTG ATTCCCCACT GGAGCCCCCT CGAAGCCGTT ATGCCTGTAC CCAAATGATG
CCCAATGGCG ATTGGCGCTG GATAAGCCAT AGTTGGGTTA TTTCCCATAC AAACAACATG
GATTAA
 
Protein sequence
MSAARKLRSI VTSKPILFLS GIVTLVAIIS ASIFLTSGAT PPPKKAVSSQ SSVENSDSAV 
ILIYHHFGKD EYPSTNIRLA QLDAQLNYLE QNHFTVWSLS QLVNTLKSRA PIPNKTVVFT
IDDAWSSVYT EAFPRFKKRG WPMTIFVNTD AIDKGYQSNM TWEQMREMQQ YGAEFANHAK
THQKLVRQPD ESHEAWQTRV TQEIKVAQQR LKSELGENTN QTKLLSYPYG EYSEALANLV
QKMGYVGIAQ NSGAVGYQSD LRALMRFPMS EVYADMDAFK LKVNTHVFPV KKITPFDPVI
TENPPKLILE FTSPPQRNIQ CFNQHGEPLL LDWASETKLE ITSDSPLEPP RSRYACTQMM
PNGDWRWISH SWVISHTNNM D