Gene Tbd_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1868 
Symbol 
ID3673978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1963808 
End bp1964938 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID637710567 
Productputative glycosyltransferase 
Protein accessionYP_315626 
Protein GI74317886 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAAAAACT GACCGTACTG CAACTGCTTC CCGCGCTCGA ATCCGGGGGC 
GTCGAGCGCG GGACCGTCGA AATCGCGCAG GCGCTGGTCG AACACGGCCA CCGCGCGCTC
GTCATGTCCG CCGGTGGGCG CCTCGTCGCC CCGCTGACGC AGGCGGGTGC TCTGCATTTC
ACCTGGCCGA TCGGCGTCAA GTCGGTGCGA ACGCTCGCGC TCGTCTCGCG CCTGAGAAAA
TTCCTGAGCG AACAAAAGGT CGACGTCGTC CACGCGCGCT CGCGCGTCCC GGCCTGGATC
GCTTGGCTCG CCTGGCGCCG CATGGACCCG TCGACGCGGC CGCGCTTCGT CACGACCGTA
CACGGTCTCT ACGGCGTCAA CCGCTACAGC GCGATCATGG CGCGCGGCGA GCGCGTGATC
GCAGTCTCCA ACACGGTGCG CGACTACATC CTGCGCGAAT ATCCCAAGAC CCTGCCGTGG
CGCGTCGACG TCATCCACCG CGGCGTCGAC GGCGCGCTCT ATCCCCATGG CTGGAAACCC
GATGCCGGCT GGCACGCTGC ATTCTTCGGT CAGTTCCCGA ATGCGGCGGG CAAGCTGCTG
CTCACCCTGC CCGGTCGCAT CACGCGCCTC AAGGGACATG AGTCCTTCAT CGAACTCGTC
GCCCGGCTGA AGCGCCGCGG ACTGCCCGTG CATGGCCTGA TCGTCGGCGG CGCGGCAGCG
TCCAAGCAGC GTTATTTGCA GAAGTTGCGC TACCGCGTGC GCAGCATGGG GCTCGAAGCC
GACATCAGTT TCACGGGCCA GCGCGACGAC CTGAAAAACA TCCTTGCTAT GTCGAACCTC
GTGCTCTCTC TGTCGACCCA GCCCGAATCC TTCGGCCGCA CGACGCTCGA AGCGCTGCGG
CTCGGCGTGC CGACCGCGGG CTTCGATCAC GGCGGCGTAG GGGAAATCCT GCGTACGGTC
TATCCGGCCG GCTTGCTGCC GATGGACCGC ATCGACGAGG CCTGCCAGCG CATCGCGCAC
CTGCTGCAGG AGCCCGACGC GGTGCCTGAG GGCGACTTCT TTCCGTTGAA GGCGATGATC
GAGCGCACGC TCGCGCTCTA CGAACAGCTC GCGCGGGCGC CGCGGCGTTA G
 
Protein sequence
MSEEKKLTVL QLLPALESGG VERGTVEIAQ ALVEHGHRAL VMSAGGRLVA PLTQAGALHF 
TWPIGVKSVR TLALVSRLRK FLSEQKVDVV HARSRVPAWI AWLAWRRMDP STRPRFVTTV
HGLYGVNRYS AIMARGERVI AVSNTVRDYI LREYPKTLPW RVDVIHRGVD GALYPHGWKP
DAGWHAAFFG QFPNAAGKLL LTLPGRITRL KGHESFIELV ARLKRRGLPV HGLIVGGAAA
SKQRYLQKLR YRVRSMGLEA DISFTGQRDD LKNILAMSNL VLSLSTQPES FGRTTLEALR
LGVPTAGFDH GGVGEILRTV YPAGLLPMDR IDEACQRIAH LLQEPDAVPE GDFFPLKAMI
ERTLALYEQL ARAPRR