Gene Tpau_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4202 
Symbol 
ID9158390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4330731 
End bp4332785 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content65% 
IMG OID 
Productprotein of unknown function DUF839 
Protein accessionYP_003649109 
Protein GI296141866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTGC CCAACCTGCG TCTGCTCGTC GATCACGACG GCCAGTCCGC GCGCTCGCAC 
ACCACCTGCA AGTACAAGTG CGGCGAGCAG TGTTTCCGCC CCCACGACAA CACGTCCGAC
AACGAGTACT TCGGCGATCT GACCCGCCGC ACCATTCTCA AGGGTGCGGG CATCGCGGCC
GTGGGCGTGG GCGCCACCAC GGTGCTCGCC GCCTGTGGCG ACGCGAGCAC CGGCTCCGCC
GGTTCGTCCT CAGCGGGTTC GGCAGCAGGT GCCGATATCA AGGCTCCCGG ACTGAATTTC
ACCTCGGTGG CCGCGAACAG CGAAGACCGC GTCGTGATTC CGGAAGGTTA CGAGCAGGCC
GTGGTGATCG CCTGGGGTGA TCCGGTGCTG CCGGGCGCGC CGAAGTTCGA CTTCGACAAC
CAGACTCCGG AGGCGCAGGC CCAACAGTTC GGCTACAACA ACGATTTCAC CGATCTGATC
GAGGTGGACG GTAAGCCGGA CACGTATCTG ATGGTGTGCA ACCAGGAGTA CACCACCGGA
AAGCACATGT TCAAGGGCTA CGACGAGAAG AACCCCACCG AGAATCAGGT GCGGGTCGAG
ATGGCCGCGC ACGGCCTCAC TGTCGTGGAG GTGAAGGGCG AGCCGGGCTC CGGCAAGCTG
ACTCCTGTTC TGGGCGAATA CAATCGGCGC GTCACCGCAT CGACGGAGTT CGAGCTGCGC
GGCCCCGCCG CCGGTTCGTG GTTCGTCAAG ACCGCCGCCG ATCCCACGGG CACCCGGGTA
CTGGGCACCA TCGCGAACTG CTCCGGCGGC ATCACCCCCT GGGGCACCAT GCTGTCCGGC
GAGGAGAACT ACACCAACTA CTTCTCCGGC GCCGACACCC CGCCGATCGA CGGCCTCAAG
GAGGGTTGGA AGCGGTACGG ACTGGGCAAG GACTCGGACT ACCACAACTG GGGTAAATAC
GAGGACCGGT TCAACCTGGC CAAGGAGATC AACGAGGCCA ACCGATTCGG CTACATCGTC
GAGCTCAATC CCCACGATCC GCGCTCCACG CCGGTGAAGC ACTCGGCCAT GGGCCGGTTC
AAGCACGAGG GCTCCAACAT CCATGTGACC TCGAACGGCA CTGTGGTGGC GTATTCGGGT
GACGACTCGA AATTCGAGTA CATCTACAAG TTCGTCTCGT CGCGGAAGAT CCAGCAGGGC
AAGGGGCCCG CCGCAATGGA GTCGAACATG CGGATCCTCG ACGAGGGCAC TCTGTACGTG
GCCAAGTTCG AGGGCCCGAA GGTCGAGGAC GGCAAGGTTC CGGCGGGCGG GTACAAGGGA
ACCGGAAAGT GGATCGCCTT GCTCACGGTG GACGCATCCG GCAAGGCGAC CTCGCACATC
GACGGCATGC CGCCGGAGAA CGTCGCGGTG TACACCCGGG TGGCCGCCGA TAAGGCCGGC
GCCACCAAGA TGGACCGCCC GGAGGACATC CAGCCGCACC CGAAGACCGG GAAGGTGTAC
TGCGCGCTGA CCAACAACAG CGACCGCGGT ACCGAGGGCA AGGCCGGTAT CGACGCGGCC
AACCCCCGGG TGAAGAACAA GAACGGCCAG ATCCTGGAGA TCACCGACGA TCACACCGGC
ACCGAGTTCT CCTGGGACCT GCTGCTGGTG TGCGGCGATC CCGCCGCGGC CGACACCTAC
TACGGCGGTT ACGACAAGAG CAAGGTGTCG CGCATCAGCT GCCCCGACAA CGTGGCCTTC
GACTCGTTCG GCAACCTGTG GATCTCGACC GACGGCACCC AGGACACCTT CAAGAGCAAC
GACGGCCTGT TCGGCGTGGT GCTGGAGGGC AAGGACCGGG GCCTGACGAA GCAGTTCCTC
ACTGTGCCCT ACGGCGCCGA GACATGCGGT CCCGTCATCC GAGACAACCG CGTTCTTGTC
GCAGTTCAGC ATCCGGGTGA GACGGACGAC GGAGGCGCCG GCAGCCCCAC CTCGCACTGG
CCCGACGGCG GCACGAACGA GCCGCGACCT GCGGTGGTCG CCGTTTGGAA GACGAGCGGC
AACATCGGCA CGTAG
 
Protein sequence
MRLPNLRLLV DHDGQSARSH TTCKYKCGEQ CFRPHDNTSD NEYFGDLTRR TILKGAGIAA 
VGVGATTVLA ACGDASTGSA GSSSAGSAAG ADIKAPGLNF TSVAANSEDR VVIPEGYEQA
VVIAWGDPVL PGAPKFDFDN QTPEAQAQQF GYNNDFTDLI EVDGKPDTYL MVCNQEYTTG
KHMFKGYDEK NPTENQVRVE MAAHGLTVVE VKGEPGSGKL TPVLGEYNRR VTASTEFELR
GPAAGSWFVK TAADPTGTRV LGTIANCSGG ITPWGTMLSG EENYTNYFSG ADTPPIDGLK
EGWKRYGLGK DSDYHNWGKY EDRFNLAKEI NEANRFGYIV ELNPHDPRST PVKHSAMGRF
KHEGSNIHVT SNGTVVAYSG DDSKFEYIYK FVSSRKIQQG KGPAAMESNM RILDEGTLYV
AKFEGPKVED GKVPAGGYKG TGKWIALLTV DASGKATSHI DGMPPENVAV YTRVAADKAG
ATKMDRPEDI QPHPKTGKVY CALTNNSDRG TEGKAGIDAA NPRVKNKNGQ ILEITDDHTG
TEFSWDLLLV CGDPAAADTY YGGYDKSKVS RISCPDNVAF DSFGNLWIST DGTQDTFKSN
DGLFGVVLEG KDRGLTKQFL TVPYGAETCG PVIRDNRVLV AVQHPGETDD GGAGSPTSHW
PDGGTNEPRP AVVAVWKTSG NIGT