Gene Tbd_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1923 
Symbol 
ID3674033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2020198 
End bp2021358 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID637710622 
Productputative transporter 
Protein accessionYP_315681 
Protein GI74317941 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.598404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCG TCCCCTACTG GCGCCTGTCC GGTTTCTACT TCTTCTACTT CGCTTTCGTC 
GGCGCGATGT CGCCGTTCTG GGGCCTCTAC CTGAAGTCGC TCGCGTTCGA CGCGGTGCAG
ATCGGCGTGC TGATGTCGCT GTTGCAGGTG ATGCGGATTT TCGCGCCGAA CATCTGGGGC
CACGTTGCCG ACCGCCTCGG CCGGCGCACG GCAATCGTGC AGGTCGCAGC GCTGGCCAGC
GTCGTCGTGT TCGCGGGCGT CTTCGTCGAC GACGGCTTCT GGTGGCTGTT CGCCGTCATG
GCGGGCTTGA GCTTTTTCTG GAGCGCTTCG CTGCCACTGG TCGAGGCGAT GACGCTCTCC
CATCTCGGCG AGCGCGCCTC GGCCTACGGC CGCATCCGCC TCTGGGGCTC GGTCGGCTTC
ATCCTGATGG TCGTCGGCCT GGGCTACGCC TTCGACCACG TCTCGATCGC CTGGCTGCCG
TGGGCGGTTC TCGTCGTGAT GCTCGGGATA CTGGCGTGCG CGCGCGTGAT TCCCGAGGCC
GTGATACCGC TGCACTCCCC TGACCATCGT TCGGTATGGG ACATCGTCAG GCGGCCGGAA
GTCGCCTCGC TGCTCGCCGG CTGCATGCTC ATGTCGGTGA CCCACGGCCC GTATTACACG
TTTTATTCGA TCTATCTGGT CGACCACGGC TACGACAAGT CGACGGTCGG CTGGCTCTGG
GCGCTCGGCG TGGCCTGCGA GATCGGCATT TTTCTGCTCG TGCCGCGCAT CTTCGCGCGC
ATGGCGCCGC CGCGTCTTCT CCTGTTGAGC TTCGCGCTCG CGGTGCTGCG TTTCCTGCTG
ATTGCCTGGG GTGTCGAATC GGCCTGGCTC GTGTGGGGTG CGCAAACCCT GCACGCCTTC
ACCTTCGGCA CCTATCATGC GGCCGCGGTC GCGCTGATCC ACCTGCATTT CCGCGGGCGT
TACCAAGCGC GCGGCCAGGC CTTGTACACG AGCCTGTCGT ACGGCGTCGG CGGCACGATC
GGCGGGCTCG CGAGCGGACT GACCTGGGAC GCGATCGGCT CGGCCTGGAC CTTCACGCTC
GCCGCCGCGA GCGCGGCCCT TGCCTGGCTC ATTTACGCGC TGTGGGGACA AACAAAACCC
GTGCCAGCGC AGGCGCGATA A
 
Protein sequence
MNAVPYWRLS GFYFFYFAFV GAMSPFWGLY LKSLAFDAVQ IGVLMSLLQV MRIFAPNIWG 
HVADRLGRRT AIVQVAALAS VVVFAGVFVD DGFWWLFAVM AGLSFFWSAS LPLVEAMTLS
HLGERASAYG RIRLWGSVGF ILMVVGLGYA FDHVSIAWLP WAVLVVMLGI LACARVIPEA
VIPLHSPDHR SVWDIVRRPE VASLLAGCML MSVTHGPYYT FYSIYLVDHG YDKSTVGWLW
ALGVACEIGI FLLVPRIFAR MAPPRLLLLS FALAVLRFLL IAWGVESAWL VWGAQTLHAF
TFGTYHAAAV ALIHLHFRGR YQARGQALYT SLSYGVGGTI GGLASGLTWD AIGSAWTFTL
AAASAALAWL IYALWGQTKP VPAQAR