Gene Tpau_2271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2271 
Symbol 
ID9156427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2364302 
End bp2365663 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003647218 
Protein GI296139975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGGA ACGGACGGTC GGGCAATCGC GCTCGTCATC GCCGGCGTCA CACCTGGCTG 
CTCGCCGTGC TGGACGCCGT CGTCGTGGCC GGATTCCTCG GCGCGACCTA CGACCCCGGT
AGCGATCCGG CGGCACACGT GACCAAGGTG CTCGCCGGGA CCGCACTGAC GATCGGGACC
TTCTACCTGT TCGGGCTCTA TCGCGCCCGA ATCACGCTCA GCGCGCTGGA CACCCTGCCG
CAGTTGGTGA TCGCGGCGTG GCTTCTCGCC CCGCTGGCGA TGGCGATGCA CTGGGACGAC
GACCACACAT TCATCCCCCA GGTGCTGATG TGCTGGGTGG CACTACTGAT GCTACGGGTG
GTGTACTACG CGGTGGTGCG CCATCGCCGG GCCGCACATC CGGAGAACGG CGCCCGGACC
CTGGTGATCG GTGGCGGCAA GGTCGCCGAC GAGCTGGTGC GCGCGATGTC CCATTACCCG
GTGTACGGCC TGCGTCCAGT GCTGGTGATG GATGACGATC CGCTCGACCC GACCCTCTTC
CCCACGGAGG TGATCCCCCG CCGCCACGAC CTGGCAGGTC TCATCGAGGA GCGCGATATC
GAGACGGTGA TCGTGGCCTT CTCCCGCGAC CGGGATTCCA CCCTGGTGGA CCCGCTGCGC
GAGTGCGACA CCCTGGATTG CGAGATCTAC GTCGTGCCGC GGCTGTACGA ATTCGTGCAT
CAGGACCGCG ATATGGACCG GATCCACACC ATCCCGCTCG TCCTGGTGCG CCGCCTCGCC
CTGCGCACCA GCCATTGGCG GGTCAAGCGG ATCACCTCGC TGATCACCTC GAGTGTGGGA
TTGCTGCTGT TCTCTCCCCT GCTGGCCCTC GTCGCGGCCG CGATCAAACT GCAGGATCCG
AGGGCGCCGG TATTGTTCCG TCAGACCCGG GTGGGCCAGG ACGGTCACCT GTTCACGCTG
TACAAGTTCC GCACCATGCG CCCGGTTCCC GATGAAGTGG CCGACGCCGA CTGGTCGACC
GACCGGACCA CACGTCTTGG CTCCTTCCTG CGCCGGTACT CGATCGACGA GTTGCCGCAA
CTGTGGAACG TGGTGTGCGG CGACATGTCG ATGGTGGGCC CGCGGCCCGA GCGCCCGCAC
TTCGTGGACC GGTTCCGCGC CGACATTCCC GCGTACAAGT CCCGGCACCG TGTGCACGCC
GGCCTCACCG GCTGGGCGGC GATCCACGGC CTGCGCGGCG ACACCGACAT CCGCGACCGC
GCGTCGTACG ACAACTACTA CATCGAGAAC TGGAGCCTGT GGCTCGACAC CAAGATTCTC
CTGATGACCG CGGCTTCGGT CCTCCTCGGT CGCGGTCGCT GA
 
Protein sequence
MSGNGRSGNR ARHRRRHTWL LAVLDAVVVA GFLGATYDPG SDPAAHVTKV LAGTALTIGT 
FYLFGLYRAR ITLSALDTLP QLVIAAWLLA PLAMAMHWDD DHTFIPQVLM CWVALLMLRV
VYYAVVRHRR AAHPENGART LVIGGGKVAD ELVRAMSHYP VYGLRPVLVM DDDPLDPTLF
PTEVIPRRHD LAGLIEERDI ETVIVAFSRD RDSTLVDPLR ECDTLDCEIY VVPRLYEFVH
QDRDMDRIHT IPLVLVRRLA LRTSHWRVKR ITSLITSSVG LLLFSPLLAL VAAAIKLQDP
RAPVLFRQTR VGQDGHLFTL YKFRTMRPVP DEVADADWST DRTTRLGSFL RRYSIDELPQ
LWNVVCGDMS MVGPRPERPH FVDRFRADIP AYKSRHRVHA GLTGWAAIHG LRGDTDIRDR
ASYDNYYIEN WSLWLDTKIL LMTAASVLLG RGR