Gene Tbd_1489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1489 
Symbol 
ID3672902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1577025 
End bp1578020 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content56% 
IMG OID637710174 
ProductAppr-1-p processing protein 
Protein accessionYP_315247 
Protein GI74317507 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000625886 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGCCA TCGTCAACAC GGTGAATTGT GTTGGCGTGA TGGGCAAAGG CATTGCGCTC 
CAGTTCAAGA ACAAGTGGCC GGCGAATTAC AGCGCCTACG AAGCGGCGTG CAAAGCCAAA
CAAGTTCGTC CGGGAACCAT GTTTGTCTTC GACTCCGGCG GCTTGGTCAA ACCCAACTAC
ATCATCAACT TCCCAACCAA GGACCATTGG CGCGGCAAGT CACGAGTTGA GTTCATTCGT
GATGGGCTGG TTGATCTGGT TGCACAAGTC AAACGGCTTG GCATTCGCTC CATTGCCATT
CCACCGCTTG GTTGCGGCAA CGGCGGGCTG GAATGGGATG AAGTGCGCCC GCTTATCGAA
CAGGCATTTG CCGTTCTGCC AGACGTGGAA GTGCGACTAT TCGCGCCTGC TGGCGCGCCT
GATCCGAAGA GCATGGAAGT GAAAACCTCA CGTCCAAAGA TGACGGCGGG TCGCGCAGCT
ATCCTCAAGG TACTCGATAC CTATCGTTCT CTGGAATACG GTTTGTCTCG TATCGAAGTG
CAGAAGCTTG CTTACTTCCT GCAAGAGGCT GGCGAGAGCC TCAGTCTGTC ATTCGTCAAG
AATCAATATG GACCCTATTC CGATCAACTG CGCCACGCGC TGAACCGGAT GGAGGGCCAC
TTCATACGTG GCCTTGGCGA TGGCGTTGTG GATGCTGAAA TCGAACCGCT GGAAGACGCA
CTTGCCGAGG CAGAGCAGTA CGTCAGTGCA AGCGGGCACG CTGCTTTGGC CCGGCACGTT
GAGCGTGTCG CGAATCTGAT CGAGGGTTTT CAGACGCCAT ACGGAATGGA ATTACTGGCT
ACGGTTCATT GGGTTGCAAC CCATGAACCG ACCGCGCATT CATTCGATCA GGTAGTCAGC
GCCGTACATG CCTGGAATGA GCGCAAGGCA AGGATCATGC AGCCTGCGCA TGTGCGGGCT
GCATGGAGTC GGCTAGACGC TCAAGGGTGG CTATAA
 
Protein sequence
MDAIVNTVNC VGVMGKGIAL QFKNKWPANY SAYEAACKAK QVRPGTMFVF DSGGLVKPNY 
IINFPTKDHW RGKSRVEFIR DGLVDLVAQV KRLGIRSIAI PPLGCGNGGL EWDEVRPLIE
QAFAVLPDVE VRLFAPAGAP DPKSMEVKTS RPKMTAGRAA ILKVLDTYRS LEYGLSRIEV
QKLAYFLQEA GESLSLSFVK NQYGPYSDQL RHALNRMEGH FIRGLGDGVV DAEIEPLEDA
LAEAEQYVSA SGHAALARHV ERVANLIEGF QTPYGMELLA TVHWVATHEP TAHSFDQVVS
AVHAWNERKA RIMQPAHVRA AWSRLDAQGW L