Gene Tpau_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1839 
Symbol 
ID9155989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1922434 
End bp1924125 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content69% 
IMG OID 
Producttype III restriction protein res subunit 
Protein accessionYP_003646796 
Protein GI296139553 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.036346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGTT CACTTCGTGT CTGGCAGCGC CGTGCGCTCA CGAAGTACCT GACCGCGAAG 
CCGCAGGACT TCCTGGCCGT GGCTACTCCG GGCGCCGGAA AGACCACGTT CGCCCTGCGT
GTGGCAGCCG AGCTGTTGGC CGATCGCACC GTGGAGCGGG TCACCGTGGT CGCCCCCACT
GAGCACCTGA AGTACCAGTG GGCCGAAGCG GCCGCGCGGA ACGGCATCAA CCTCGACCCG
AACTTCACCA ATTCGGGCGG CAGCACTTCG TCCGACTTCG ACGGTGTCGT GATCACCTAC
GCACAGGTGG GGATGCACCC GTACAAGCAC CACGCGCGCA CCACCGCCTA CAAGACCCTG
GTGATCCTTG ATGAGATCCA CCACGCCGGT GACGCGAAGA GCTGGGGCGA GGGCGTGCGC
GAGGCCTTCG AGGGTGCGAC CCGCCGTCTG GCGCTCACCG GCACCCCGTT CCGCAGCGAC
GACAACCCGA TCCCGTTCGT CACCTACGAG CCGGAGTTCG GCGGGGGACA GCGGTCGAAG
GCCGACCACG TCTACGGCTA TTCCGACGCA CTCGCCGACG GCGTGGTGCG GCCTGTGGTC
TTCCTCGCCT ACTCGGGCCA GGCGAGCTGG CGCACCAGCG CCGGTGAGGA ATTCACCGCC
CGCCTCGGCG AACCGCTGAG TAAGGAGCAG ACCGCCCGGG CGTGGCGCAC GGCGCTCGAC
CCGCACGGCG ATTGGATCCC CGCCGTGCTG CACGCCGCGA ACACCCGTCT CGATCAGCTG
CGCCGGACGA TGCCCGATGC CGGCGGCCTG GTGATCGCGA CCGATCAGAG CACCGCCCGC
GACTACGCCG AGCTGCTGCA CGACATCACC GGCGAGAAGG TCACGGTGGT GCTGTCCGAC
GATCCGACCG CCTCGAAACG GATCAGCGAG TTCTCGTCGA GCCGGGACAA GTGGATGGTC
GCGGTGCGGA TGGTGTCCGA GGGCGTCGAT GTTCCGCGTC TCGCGGTGGG TGTGTACGCC
ACCAGCGCCT CGACGCCGCT GTTCTTCGCG CAGGCGATCG GCCGCTTCGT CCGGTCGCGC
GCGCAGGGCG AGACCGCCAG CGTGTTCCTG CCCTCGGTCC CGGTTCTGCT CGACCTCGCG
TCGAAGCTGG AGGAACAGCG TGACCACGTC CTGGGCAAGC CGCATCGCGA GTCCGACGGG
CTCGACGACG CGCTGCTGAT CGACGCGAAC AAGCAGAAGG ACGAGCCCGG CGAGGAGGAG
AAGGCCTTCG TCTCACTGCA CGCCGACGCC GAACTCGACC AACTCATCTA CGACGGATCG
TCCTACGGGA CCGCAACCTT CGCGGGCAGC GACGAGGAGG CCGACTACCT GGGGCTGCCC
GGCCTGCTCG ATGCGGAGCA GATGCGGGCG TTGCTCAAAC AGCGGCAGAA GGAACAGGTG
CAGACCCGCA CCGTCGAGGC CGAGCGGCCG GCTCCGCCGG TCGAGGTCGA ACAGCGGGCC
ACCGCCGGTG AGCAGTTGTC GGCGTTGCGT CGCGAGCTGA ACTCGCTGGT CGCCATGCAT
CATCACCGCA CCGGCAAGCC ACACGGTGTG GTCCACAACG AGCTGCGCAG CCGCCTGGGC
GGCCCGGTCA CTGCGATGGC GTCCGCCGAG CAACTGCGCG AGCGGATCGC CGCTCTGCGC
ACCTGGCGCT AG
 
Protein sequence
MAGSLRVWQR RALTKYLTAK PQDFLAVATP GAGKTTFALR VAAELLADRT VERVTVVAPT 
EHLKYQWAEA AARNGINLDP NFTNSGGSTS SDFDGVVITY AQVGMHPYKH HARTTAYKTL
VILDEIHHAG DAKSWGEGVR EAFEGATRRL ALTGTPFRSD DNPIPFVTYE PEFGGGQRSK
ADHVYGYSDA LADGVVRPVV FLAYSGQASW RTSAGEEFTA RLGEPLSKEQ TARAWRTALD
PHGDWIPAVL HAANTRLDQL RRTMPDAGGL VIATDQSTAR DYAELLHDIT GEKVTVVLSD
DPTASKRISE FSSSRDKWMV AVRMVSEGVD VPRLAVGVYA TSASTPLFFA QAIGRFVRSR
AQGETASVFL PSVPVLLDLA SKLEEQRDHV LGKPHRESDG LDDALLIDAN KQKDEPGEEE
KAFVSLHADA ELDQLIYDGS SYGTATFAGS DEEADYLGLP GLLDAEQMRA LLKQRQKEQV
QTRTVEAERP APPVEVEQRA TAGEQLSALR RELNSLVAMH HHRTGKPHGV VHNELRSRLG
GPVTAMASAE QLRERIAALR TWR