Gene Tpau_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1032 
Symbol 
ID9155172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1058078 
End bp1059538 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content70% 
IMG OID 
Productcondensation domain protein 
Protein accessionYP_003646004 
Protein GI296138761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTCA CCGAGATCGA CGCCTACCCG CTGCGCCCCG GCGAGCTCCG CAATTGGGTC 
CCCACGACCG GGCCGGCGGC GCAGTGGCGC GATGACCCGC GGGCCACCTC GCACGTGCAC
GAGGCGCATC TGCGGGGCGC CGAGGCGGTG CTGCGGCGCC GGCACATCGA CGGCGGCCGC
GAGTCCTGGT TGGGCCTGGG CGTCGAATTC GACGAGCCGC TGTCGATCCC GGGCCTGCGC
ACCGCGCTGC GGGCCTGGAT CGACCGGCAC GAGGTGCTGC GCACGCACGT CTTCCTGGAG
GACGGTGCGC CGCGCCGGCG CAGCGCGCAG ATCGACACCG TCGATCTGAA GGTGCAGACG
GTGGGCACGT ACAACTCCAC CGAACCGCTC GCGGAACAGC TGCTCGGCGA GTTCGACCGG
GCGACGGCCC CACTGACCTG GCCCGCATAC ATGTTCTCCA CCGTGGCGCG CGAGGACTCG
TTCACCCTCT TCTTCGCCGC CGATCACTCT CTGCTGGACG GCTATTCGCT GATCCTGAGC
CCGTACGAAC TGCGCGAACT GTATCGGCAG GCGGTGCACG GCGACGAACC GAAGCTGATC
CCCGCGGGTA GCTACGTCAA CTATTCGGAC ACCGACCGGC AGTCCGCCGA CCGGGCCACC
GCCGCCCATC CCGCCGCGAT GCTCTGGGAC GAGTACCTCA CCGAGACCGG CTATCAGCCC
TCACCGTTCC CGATGCCACT GCTGCCGCCC AGCGCCAAGG TGCTCGCCGA CGAGACCCTC
GCCGCGCTCA CCCTCGATCC CGACGGTGTG CCCCGGATGC CGCAGAACGC CCTGAACTTC
CACCTGCTCG ACGACGAGCG GGCCAACAAC TTCACCCGGG TCTGCTCCGA GGCCGGTGCC
AGCCTGGTGA CCGGCGTCCT CGCCTGTATG GCGAAGATCA ACACCGACCT CGGCTTCGGT
CCGATCTTCC GCTGCGCGGT CACCCGGCAC ACCCGTGACG CCGAGCAGTG GATCGCGGCG
CTCGGCTGGT TCGTGGGCAT CGCGCCGTTC CGGCTCGACA CCACGGGCGC TCGCACCTTC
GGCGAGCTGG CCCAGCGGGC ACAGGAGCAG TGGCGGCATT CGAAGACCGG CAGCACCCTG
CCCTACCTAC GGATCGGCGA GGTGCTGGCG CAGGAGCCCG GTCATGCCCA GGCCCCGCCG
CCGCGTTTCG TCGTGTCCTT CATGGACACC CGGTCGGCAC CCGGATCGGC GATCAACGAT
GCCGGCGGTG CGAGTGCGCT GCGCTCACGC GACTACTCCC TCGATGACGT CTATCTGTGG
ATGCTGCGCA CCCCGTCGGG ACTGCACGTC GCGGCCCGGT TCCCCGGCTT CGACACCGCG
CGCCGCAGCC TCACGCTGTA CCTCGGCGCG CTACGGTCGA TGTTGACCGA GATCGGCGAC
GCAACGGTTA CCCGTGGGTA A
 
Protein sequence
MQFTEIDAYP LRPGELRNWV PTTGPAAQWR DDPRATSHVH EAHLRGAEAV LRRRHIDGGR 
ESWLGLGVEF DEPLSIPGLR TALRAWIDRH EVLRTHVFLE DGAPRRRSAQ IDTVDLKVQT
VGTYNSTEPL AEQLLGEFDR ATAPLTWPAY MFSTVAREDS FTLFFAADHS LLDGYSLILS
PYELRELYRQ AVHGDEPKLI PAGSYVNYSD TDRQSADRAT AAHPAAMLWD EYLTETGYQP
SPFPMPLLPP SAKVLADETL AALTLDPDGV PRMPQNALNF HLLDDERANN FTRVCSEAGA
SLVTGVLACM AKINTDLGFG PIFRCAVTRH TRDAEQWIAA LGWFVGIAPF RLDTTGARTF
GELAQRAQEQ WRHSKTGSTL PYLRIGEVLA QEPGHAQAPP PRFVVSFMDT RSAPGSAIND
AGGASALRSR DYSLDDVYLW MLRTPSGLHV AARFPGFDTA RRSLTLYLGA LRSMLTEIGD
ATVTRG