Gene Tpau_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1939 
Symbol 
ID9156094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2023155 
End bp2024402 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003646891 
Protein GI296139648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.314426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCG AGACCGCACC CGACGGTACG CGCTTGGCCG CGGCCGAATC CCGACATGCG 
GCATCGCCCG CCGCCCCCGT CCTCGACATC GTGATCCCCG TGTACAACGA GGCGCACACC
ATCGCCCACT GCGTGGAGAC CTTGCACGCC TACCTCACCG ACACCCTGCG GGTCCCTGCG
CGCATCACCA TCGCCGACAA TGCGAGCACC GACGAAACCC TGCGCGTGTC CCACTCCCTG
GCGAGCGCCA TCGACGGAGT CCGCGTGGTG CACCTGGACG CGAAAGGCCG TGGCCGGGCG
CTGCGCCGAG TGTGGTCGGA GTCCGATGCG CAGGTGCTCG TGTACATGGA CGTCGACCTG
TCCACTGACC TCAACGCCCT GCTCCCGTTG GTCGCTCCCC TCATCTCCGG ACACAGCGAC
CTCGCGATCG GCACCCGGCT GGGCCGTGGT GCCCGAGTGC GACGGGGCCC CAAGCGGGAA
TTCATCTCCC GCGGCTACAA CGTGCTGTTG CACACCGCGC TGCGCGTGCG CTTCTCCGAC
GCCCAGTGCG GATTCAAGGC GATCCGCACC GACGTCGCGC GGGAGTTGCT ACCCCTGGTG
GAGGACGGTG AATGGTTCTT CGACACCGAA CTACTGGTGC TGGCGGAGCG CGCCGGACTG
CGCATCCACG AGGTCCCGGT CGATTGGACC GACGATCCGG ACAGCCGGGT CGACATCGTC
GATACCGTGG CCAAAGATCT GCGAGGCATG GCCCGTGTGG GTCGGGCGCT GGCGGCCGGG
CGGCTGCCAC TCGACGACGT ACGTCGTGCG GTCGGACGCG ACGAACCCCG GATCGCCGGC
GTGCCGCACG GGATGATCGG TCAGCTCGCC CGGTTCGCGG TCGTCGGTTT GGCGAGCACG
GTGGCCTACG CAGTGCTGTA TCTGGCGCTG CACTCGGCGA TCGGCGCACA GGCCGCGAAC
TTCGCAGCGC TCCTCATCAC CGCCGTGGGC AACATCGCCG CGAACCGCGC ATTCACCTTC
GGTGTGCGAG GCCGGCGCGG CGCGATGCGG CACCACACGC AGGGCCTGGT GGTCTTCCTC
GTCACGTGGG CACTCACCGC CGGAAGTCTG GCACTGCTGG CGACGGCGGC GCCCGCAGCA
TCCCGGGAGC TGCAGCTCGC GGTGCTGGTG ATCGCGAATC TGGTGGCGAC GGTGCTGCGC
TTCGTGGGCA TGCGGCTGAT CTTCCGGTCC CCCGGGGCCG CCCCGTGA
 
Protein sequence
MTTETAPDGT RLAAAESRHA ASPAAPVLDI VIPVYNEAHT IAHCVETLHA YLTDTLRVPA 
RITIADNAST DETLRVSHSL ASAIDGVRVV HLDAKGRGRA LRRVWSESDA QVLVYMDVDL
STDLNALLPL VAPLISGHSD LAIGTRLGRG ARVRRGPKRE FISRGYNVLL HTALRVRFSD
AQCGFKAIRT DVARELLPLV EDGEWFFDTE LLVLAERAGL RIHEVPVDWT DDPDSRVDIV
DTVAKDLRGM ARVGRALAAG RLPLDDVRRA VGRDEPRIAG VPHGMIGQLA RFAVVGLAST
VAYAVLYLAL HSAIGAQAAN FAALLITAVG NIAANRAFTF GVRGRRGAMR HHTQGLVVFL
VTWALTAGSL ALLATAAPAA SRELQLAVLV IANLVATVLR FVGMRLIFRS PGAAP