Gene Tpau_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_0550 
Symbol 
ID9154686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp577465 
End bp580653 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content70% 
IMG OID 
ProducttRNA-guanine transglycosylase, various specificities 
Protein accessionYP_003645529 
Protein GI296138286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCGA CGACGCTGGA GAACCAATCA CCCGAGCAGG CCGCCGGCGT GCCGAGCGGA 
CCACTGACGC ATAAGCAGAT CCTCTACATC CTCGCCGGAC TCATGACCGG CATGTTCCTG
GCAGCACTGG ACCAGACCAT CGTCTCGACG GCGATCCGGA CCATCGCCGA CGATCTCAGC
GGCTACCAGT TGCAGGTCTG GGTGACCACG GCGTACCTGA TCACGTCGAC CATCGTGACG
CCGCTCTACG GCAAGCTCTC CGACATCTAC GGACGCAAGC CGTTCTTCAT GTTCGCGATC
ACGGTCTTCG TGCTCGGGTC ACTGCTGTGC AGCTTCTCCA CATCGATGTA CGAGCTGGCC
GCGTTCCGCG CCGTCCAGGG ACTCGGCGCC GGCGGCTTGA TGTCGTTGGC GCTCACCATC
GTCGGCGATA TCGTGCCGCC GCGCGAACGC GCCCGCTACC AGGGCTACTT CCTCGCGGTG
TTCGGCACCA GCTCGGTGCT CGGCCCCGTT CTGGGCGGCG TCTTCTCCGG CGCCGACAGC
ATCCTGGGTA TCACCGGCTG GCGCTGGGTG TTCCTGGTGA ACGTGCCGAT CGGCGTGATT
GCGCTGCTCG TGGTGTATCG CGTCCTTCAG CTGCCGCACA CCCGCCGCGA GGGGGTGCGG
ATCGACTGGG TGGGCGCCTT CATGCTCGCC GTCGGCATCG TGCCGTTGCT GATCGTGGCC
GAGCAGGGCC GCGAATGGGG TTGGGGCTCG ACCAAGTCCA TCACCTGCTA CGCCGTCGGG
GCAGGTGGCG TGCTCGCGTT CATCATCATC GAGAAGATGA TGGGCGAGGA CGCGATCATC
CCGCTGCGCA TCTTCAGGAA CCGGATCTTC GCGCAGGGCG TGCTGATCTC GGTGGTGGTG
GGCGCTGCGA TGTTCGGCGG GATCTCGCTA CTGCCGCAGT ACTTCCAGGT GGTGCGCGGT
GCCAGCCCGA CGGTGGCCGG TTTCATGATG CTGCCCATGG TGCTGGGACT GATGATGGGC
TCGATCCTGG CCGGCCAGAT GATCTCCCGC ACGGGGCGCT ACCGGATCTA CCCGATCATC
GGAGCCGTGC TGCTCACCGT GGGCATGTGG GCGCTGCACT ACGTGACCGC CGATGTGCGC
CTCGCGGCCG TGATGGCGGG CGCCGCACTC ATCGGCTTCG GTCTCGGCAA TCTGATGCAG
CCGTTGACCC TGGCGATGCA GAACATCCTT CCGCCCCAGG ACATGGGCGT CTCCACCGCC
GCCGCCACCT TCTTCCGCCA GATCGGCGGC ACACTCGGCG TCGCCGTCTT CCTCTCCGTG
CTCTTCTCGG AGATGCCGAA GAACATGCAG TCGCGACTGA CCGATGCCGC GGCTGATCCC
GACTACACGG CAGCACTCCA GCGCGCCGCC GCGGGCCACG ACGGCCAGGC CGCGCAGGAG
TTCGCGAACA AGCTCGCCGC GCGCGATTCG AGTGCCGTGA GCGGGATCAT GTCCGATACC
TCGGCCCTGC AGAAGCTCCC GTCGACCCTG GTTCACCCGT TCAAGGCCGG CTTCGCCGAC
TCGATGGACA CCGTCTTCCT GGCGGTCACG GTGCTCTCCG CGATCGGTCT GCTCCTGGTG
CTGTTCTGGA AGTCGGTGCC GCTGCGTACC GCACCCGCCA TGGAGACGAT TGAGGAGGAG
GTCGTCGGCG CCGTGCTCGC CGGTGGCCCC GTTGCCGAGC CCGAACCCGC CGCCGCGGCG
GATACGGCGC CCGTCTCCCC GGCTCCCGTC GGCGCCCACC CACCCGGGAC GACCGGTGTC
GGCACCGCAT CGCGCGCGGT CGCGAGGCAG ACACGGGCGC CCGGGCCGCG TACCGTCGAA
CCCGTGCCGC CCGACGACGA CCGCACCGCA CCCGTCCGTG TCGATCCGGC GCTCGCCGAG
ATCGCCGAAC TGCGCAGCCA GAACGCGGAG CTCGCCGCGG AGCTCGCCGA ACTGCGCACC
GGATACCGCG CGATCGCCGG CCACACCCGG CTGCGTGCCG TCGGCGCCTC CGATCCCACC
GGATTCCGGA TCGAGGTGCA GGGCCCGGGC GCCGCGCGCA CCGGCACGTT GAGCACCCCG
CACGGCGAGA TCGCCACGCC CGCCTACCTG CCCGTCGCCG CTCGTGGCGC CGTTGCCGGC
ATCACCCCCG AGATGCTCGC CGCGCTCGGT GTGCAGGCCG TCACCGTCGA TCTGCACGAG
CTGTACCTGC AACCCGGCAC CGCGATGATC GAGGGTGCCG GCGGGATCGG TCGGGCCATG
GCGTGGGCCG GTCCGGTCCT CGGCGACACC GGCATCACCG CCGTCGCCGC CGCGAAGAAG
ACCCGGGTCA CCGACGAGGG CATCGCCTTC CGTAGCCGCC TCGACGGCGG TGCGCATCGC
TGGGACCCCG AGGAAGCGGT GCGCGTCGCG CACCGGATCG GCGTCGATAT CGCGTTCGCC
ATGGCCGATC CGGTTGCCGC CACCGCACCC CGTGCCCAGC TGGAGGCCGG CGTCAACCGC
AGCCAGCAGT GGGCGGCCCG TGCGCTGGTC GAACACTCGT GGCAGACCGC CGACCGCGGT
GACCGGCAGT CCCTGTGGGC CGTCGTCACC GGTGGCGCCG ATGCCGGGCT GCGCCGCGCC
GCGGCGCGAG GCCTGCGTCG CCTGTCCGAA CAGGATGTGC AGTCGGGCGG GCTCGGGTTC
GGCGGCTACC GCATCGATGG AGTGCGGTCG GCGGCGGCGG AAGCTCTGCA ATTGCGTGCT
GCCACCGAGG AACTCGACCG GGAACGGCCA CGACACCTCG CCACCGTCGC CACCCCCGCG
GACCTGTTCG CAGCCATCGA GGCCGGTATC GACCTGTTCG ACGGAACCGC TGCGGCGGAG
GCGGGCCGGG AAGGTCGCGT CTTCACCCGT GACGGTGTCC TGGACCTCAC GGACCCCCAG
CTGCGTGCCG ACTTCCGGCC GATCGACCCC GGCGCAGATC GTCGCGGCCC CGCCAACCCG
GCCGACGAGT TCACTCGGGC GTACGTCAAT CATCTGTTCG CCGCGAACGA GGGCCTGGCA
GTCACGCTGT GCACCCTGCA CAACGAGCAC TTCTTCGCGA CCCTCGCGGC GGACGCCCGG
CGTGCACTGA CGGACGGCCG GTACCCGGCC TTCACCGAAT CCTTCCTGCG GCGCTTCGCT
AAGGTCTAA
 
Protein sequence
MTATTLENQS PEQAAGVPSG PLTHKQILYI LAGLMTGMFL AALDQTIVST AIRTIADDLS 
GYQLQVWVTT AYLITSTIVT PLYGKLSDIY GRKPFFMFAI TVFVLGSLLC SFSTSMYELA
AFRAVQGLGA GGLMSLALTI VGDIVPPRER ARYQGYFLAV FGTSSVLGPV LGGVFSGADS
ILGITGWRWV FLVNVPIGVI ALLVVYRVLQ LPHTRREGVR IDWVGAFMLA VGIVPLLIVA
EQGREWGWGS TKSITCYAVG AGGVLAFIII EKMMGEDAII PLRIFRNRIF AQGVLISVVV
GAAMFGGISL LPQYFQVVRG ASPTVAGFMM LPMVLGLMMG SILAGQMISR TGRYRIYPII
GAVLLTVGMW ALHYVTADVR LAAVMAGAAL IGFGLGNLMQ PLTLAMQNIL PPQDMGVSTA
AATFFRQIGG TLGVAVFLSV LFSEMPKNMQ SRLTDAAADP DYTAALQRAA AGHDGQAAQE
FANKLAARDS SAVSGIMSDT SALQKLPSTL VHPFKAGFAD SMDTVFLAVT VLSAIGLLLV
LFWKSVPLRT APAMETIEEE VVGAVLAGGP VAEPEPAAAA DTAPVSPAPV GAHPPGTTGV
GTASRAVARQ TRAPGPRTVE PVPPDDDRTA PVRVDPALAE IAELRSQNAE LAAELAELRT
GYRAIAGHTR LRAVGASDPT GFRIEVQGPG AARTGTLSTP HGEIATPAYL PVAARGAVAG
ITPEMLAALG VQAVTVDLHE LYLQPGTAMI EGAGGIGRAM AWAGPVLGDT GITAVAAAKK
TRVTDEGIAF RSRLDGGAHR WDPEEAVRVA HRIGVDIAFA MADPVAATAP RAQLEAGVNR
SQQWAARALV EHSWQTADRG DRQSLWAVVT GGADAGLRRA AARGLRRLSE QDVQSGGLGF
GGYRIDGVRS AAAEALQLRA ATEELDRERP RHLATVATPA DLFAAIEAGI DLFDGTAAAE
AGREGRVFTR DGVLDLTDPQ LRADFRPIDP GADRRGPANP ADEFTRAYVN HLFAANEGLA
VTLCTLHNEH FFATLAADAR RALTDGRYPA FTESFLRRFA KV