Gene Tpau_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4103 
Symbol 
ID9158291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4227826 
End bp4229463 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content68% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003649011 
Protein GI296141768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACGA GCCTGGCCTC CGATCTGCTT CCTCAGGGTC ACGCACTCTT GCGCATGATC 
CAGCGACGGG TCGTGGACCC CGCTCGCCCG GACGTCGCTC TGCGTGCCCT CGGCTACAAC
CGGCAGTACG GCCCGCAGGC CGCGCTGGTG ATCAAGGGGG CCGCGGAGAA CCCGGACCGG
GCCGCGATCG TCGACGAGCA CGGCACCCTC ACCTACGCCC AGTACGAGGC GCAGTCGAAT
GCGCTGGCCC GCGGCCTGCG GTCGACCGGC CTCAAGGCGG GCGATGTGAT CGCGGTGCTG
GCGCGCGATC ACCGCGGGCT CATGCTGATC ATCAGCGCCG CGGCTCGGGC CGGGCTACGG
CTGGCCATGA TGAACACCGG CTTCGCCAAG CCCCAGTTCG CCGAGGTGTG TGCGCGCGAG
AAGGTGCAGG CGGTGTTCCA CGACAGCGAG TTCACCTCGC TGCTGGACGC GCTGCCCGAC
GATATGCCCC GTTACCTCAC CTGGGTCGAC GACACCGACA CGATCCCCGA GGGCGCGCAG
ACCATCGACC AGCTCGCCGC GGGCCGCGAG ACCAAGCGGG TGCCCCCGCC GGCGCAGCAG
GGCGGCTTCA TCATCCTCAC CTCCGGCACC ACCGGCCTGC CGAAGGGCGC CACCCGGAGC
AAGGTGCCCT CGCTCGCGAC CGCGATGCTG GTCGATCGCA TTCCGTTCCA GCGCCGCGGC
ACCGTGGTGA TCGCCTCGCC GATCTTCCAC TCCACCGGCT TCGCGATGTG GTCGGCGGGA
ATGTCGGTGG GCTGCACCAC CGTCACCATG CGCCGTTTCG ATCCCGAGAA CACGCTCAAG
CTGATCGCCG ACAACAAGGC CGACATGCTG GTCGCGGTAC CCACGATGCT GACCCGCATG
CTCTCGCTCC CCGCCGAGAC CCTGGCGAAA TACGACACCA GCTCGCTGAA GTCGGTAGTG
GTCGCGGGTT CGGCTGTCTC ACCGGAGCTT TCGGAGCGAT TCCAGGACAC GTTCGGTGAC
GTGCTCTACA ACGTCTACGG TTCCACCGAG GTCGCCGTGG CCACCGTGGC GACGCCGCAG
AACCTGCGGA CCGCGCCGGG CACCGTCGGT AAGCCGCCGG TCCTGACCAC GGTGCGGCTG
TACGACGAGA ACGATCGCCT GGTCGAGGGA GTCGGTGTGC GCGGCCGCGT GTTCGTCCGC
GCCGGTGCGC CCTTCGAGGG CTACAGCGAC GGCCGCACCA AGCAGATCAT CGACGGCCAT
CTCTCGTCGG GCGATATGGG CCACTGGGAC GGAAACGGCC TGCTGCACAT CGACGGTCGT
GATGACGACA TGATCGTCTC CGGCGGCGAG AACGTGTATC CACTCGAGGT GGAGAACCTT
CTGGTGACCC GCGACGACGT CGTCGAGGCT GCGGTGATCG GTGTGCCCGA TGAGGAGTTC
GGTCAGCGGC TGCGCGCCTT CGTGGTGCTG TCCGACGGTG CACCCGAGGG CGATGGCGAG
GAGCTGACCA AAGACCTCAA GGACTTCGTC CGCGGGAATC TGGCGCGGTT CAAGGTGCCG
CGCGACGTCG TCTTCCTCGA CACGCTCCCC CGCAACCCCA CCGGCAAGAT CGTGCGCCGG
GAACTCCCCA AGGACTGA
 
Protein sequence
MGTSLASDLL PQGHALLRMI QRRVVDPARP DVALRALGYN RQYGPQAALV IKGAAENPDR 
AAIVDEHGTL TYAQYEAQSN ALARGLRSTG LKAGDVIAVL ARDHRGLMLI ISAAARAGLR
LAMMNTGFAK PQFAEVCARE KVQAVFHDSE FTSLLDALPD DMPRYLTWVD DTDTIPEGAQ
TIDQLAAGRE TKRVPPPAQQ GGFIILTSGT TGLPKGATRS KVPSLATAML VDRIPFQRRG
TVVIASPIFH STGFAMWSAG MSVGCTTVTM RRFDPENTLK LIADNKADML VAVPTMLTRM
LSLPAETLAK YDTSSLKSVV VAGSAVSPEL SERFQDTFGD VLYNVYGSTE VAVATVATPQ
NLRTAPGTVG KPPVLTTVRL YDENDRLVEG VGVRGRVFVR AGAPFEGYSD GRTKQIIDGH
LSSGDMGHWD GNGLLHIDGR DDDMIVSGGE NVYPLEVENL LVTRDDVVEA AVIGVPDEEF
GQRLRAFVVL SDGAPEGDGE ELTKDLKDFV RGNLARFKVP RDVVFLDTLP RNPTGKIVRR
ELPKD