Gene Tpau_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_3562 
Symbol 
ID9157741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp3669238 
End bp3670548 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content66% 
IMG OID 
Productcondensation domain protein 
Protein accessionYP_003648480 
Protein GI296141237 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.369084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCGG CCGAATTGAC CGATAACGAA CCGTCTTTCA TTCAAGAGGA TCACCTTCGG 
AGCGGCACCG ACGAGGAACA GTCCATTCAG CGACTCGTCT ACGTCACTTT CACGATGAAC
GATGTGTACG ACGCGAAGGC GATGGAGCAG GCCTTCACCG AATACGCGCG CCAGCACAGC
AGCTATCACG CGCAGTACGT CAAGTGCGAC TGCGGCTACC GAGCGCGGTA CATCCGCCCC
GAAGACATGG AGCTCGAGGT GGTGGCCACC TCCGAGACCA CGGGCCCCGG TGTCACCGCG
AAGGACTATC TGCAAACCGC CCTCCCGGAT CTCGACGAGT GGTCCTCGTT CGCCTTCGGA
GTCACCGGTA TCGAGGCGGC GCAAGACGAC TCGCCGGCCC CGCAGTTCAC CGTGCTGATC
GCGGCCGACC ACCTGTTCAC CGATGCGATC TCGCTGTCGA TCTCCTTCTA CGAACTGGTC
TCCCGCTACG CCGCGATCCG GGCGGGCCGG GAGTACGTGG CTCCCCCCGT ACGCTCCTAC
CGCGATTTCA GCGCGGAGCA GCGGGAACGG GCACGGGGCC TACACCCGGA CCATCCCGAG
GTGACGCGCT GGCGCGAAAT CGTCCGGCGA GCCGGTGGGA TGCCCCGCTT CCCGCTTCCG
CTGGGCCTCG AATCGGGTCG CGGCATCGCA CCGCAGATCG CCGTGGAAAC GGCGTTCCTC
GACCCCGCGC GGGTGGCCGC CTTCGCCGCC ACCGCGAAGC AGTGCGGTGG CGCCATGGGC
AGCGCCCTGT TGGCAGTGCA GGCCGAACTC GAACGCGAGC TCACGGGCAG CGATCTCTTC
ACCATGATGG CGCCACGGTC GCACCGCCCC GACCCCACGG ACCTCATGGC CGTCGGGTGG
TACATCACGC TGGTCCCGGT GCAGTTCTCC ACACGAGGCG ACTTCCCCGA CCTGGTCAAG
GCGGCTCAGC AGGCCCTGAC CACCGCGCGG GAACTCGAGC GGCTCCCGGT GTTCCCGGTG
ATCGACGTGC TGCGCGACGA CCCGGACTTC CCGGTGGACC ACGGTTTCGA TGCCCCGATG
CTGTCGTATA TCGATATCAC GCGAACTCCG GGTGCCGAAT TGGCGCGCAG TCACGACGTT
TCCATATTCG CGAATGAGAC GCCGATGCGC GAGGTGTACA TGTGGATCAA TCGCGACGCC
GACGGGCTCG ATTTCCGGGC GATGTACCCC GGCAATCCTA CCGCGGAGGC ATCGGTGCGG
ACCTATTTCT CCCTGCTCCG CGATCGCCTG GACCAGCTCT CACACGTCTG A
 
Protein sequence
MNAAELTDNE PSFIQEDHLR SGTDEEQSIQ RLVYVTFTMN DVYDAKAMEQ AFTEYARQHS 
SYHAQYVKCD CGYRARYIRP EDMELEVVAT SETTGPGVTA KDYLQTALPD LDEWSSFAFG
VTGIEAAQDD SPAPQFTVLI AADHLFTDAI SLSISFYELV SRYAAIRAGR EYVAPPVRSY
RDFSAEQRER ARGLHPDHPE VTRWREIVRR AGGMPRFPLP LGLESGRGIA PQIAVETAFL
DPARVAAFAA TAKQCGGAMG SALLAVQAEL ERELTGSDLF TMMAPRSHRP DPTDLMAVGW
YITLVPVQFS TRGDFPDLVK AAQQALTTAR ELERLPVFPV IDVLRDDPDF PVDHGFDAPM
LSYIDITRTP GAELARSHDV SIFANETPMR EVYMWINRDA DGLDFRAMYP GNPTAEASVR
TYFSLLRDRL DQLSHV