Gene Tpau_2681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2681 
Symbol 
ID9156842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2787143 
End bp2788273 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003647620 
Protein GI296140377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGCA CGCTGCTGGT CACCAATGAC TTCCCGCCCC GCCCCGGCGG CATCCAGACG 
TACCTACAGG CCTTCGTCTC GCAGCTGCCG GCCGATGAGC TCGTCGTCTA CGCCTCGCGC
TGGAAGGGCA GCGAGGAGTA CGACGCAGCG CAGCCCTACG AGGTAGTGCG GCATCCCACC
TCGCTGCTCG TGCCGAGCCC CGATGCGCTC GGCCGCGCCC GGGATCTGGT GCGCAGCCAT
GACATCGAGA CGGTGTGGTT CGGTGCCGCC GCTCCGCTCG CGCTCCTCGC CGCACCGTTG
GAGGCGGCCG GTGCGCAGCA CAGCGTGGCG TGCACCCACG GCCACGAGGT GGGCTGGTCG
ATGCTGCCGC CCTCGCGGGC ATGTCTGCGC CGGATCGGCG ACACCACCGA CGTGATCACG
TACGTCAGCA AGTACACGCG GGGCCGGTTC GCCGCCGCCT TCGGCGCGCA GGCCGCCTTG
GAACACCTGC CGCCCGGCGT CGACACCGAC GTCTTCAAGC CCGACGCAGC CGCCCGCTCG
GAGCTGCGTG CCCGGTACGG CCTGGGCGAC GACGAGCCCG TGGTGCTGTG TCTCTCGCGC
CTGGTGCCGC GGAAGGGACA GGACATGTTG ATCCGGGCGC TGCCGAAGAT CCGGGCGCAG
GTGCCCGGCG CGAAACTGGT GATCGTCGGT GGCGGTCCCT ACTCCCAGAC TCTGCACAAG
CTGGTTCGCA GCACCGATGT CGAGGAGGCC GTGATCTTTA CCGGCGGCGT CTCCGCCGGT
GAGCTGGCGG CGCACCACAA TCTCGGGGAC GTCTTCGCCA TGCCCTGCCG TACCCGCGGT
GCCGGGCTCG ACGTCGAGGG TCTCGGCATC GTCTTCCTGG AGGCCTCCGC AACCGGAAAG
CCGGTCGTGG CGGGCGATTC CGGCGGCGCG CCGGAGACAG TGTGGGAGGG CGAGTCGGGT
CACGTCGTGC CAGGACGTGA TGTCGACGCG ATCGCAGATG CGGTCGCGGG CCTGCTCGCC
GATCCTGATC GCGCAGCCGC CTTCGGCGCC CGCGGCCGGG AATTGGTGGG GGAGCACTAC
AACTGGCGCC GCCTCGGCCA CCGCCTGCAG ACCCTGCTGA GCCCGTACTA G
 
Protein sequence
MPRTLLVTND FPPRPGGIQT YLQAFVSQLP ADELVVYASR WKGSEEYDAA QPYEVVRHPT 
SLLVPSPDAL GRARDLVRSH DIETVWFGAA APLALLAAPL EAAGAQHSVA CTHGHEVGWS
MLPPSRACLR RIGDTTDVIT YVSKYTRGRF AAAFGAQAAL EHLPPGVDTD VFKPDAAARS
ELRARYGLGD DEPVVLCLSR LVPRKGQDML IRALPKIRAQ VPGAKLVIVG GGPYSQTLHK
LVRSTDVEEA VIFTGGVSAG ELAAHHNLGD VFAMPCRTRG AGLDVEGLGI VFLEASATGK
PVVAGDSGGA PETVWEGESG HVVPGRDVDA IADAVAGLLA DPDRAAAFGA RGRELVGEHY
NWRRLGHRLQ TLLSPY