Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3562 |
Symbol | |
ID | 9157741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3669238 |
End bp | 3670548 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | condensation domain protein |
Protein accession | YP_003648480 |
Protein GI | 296141237 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.369084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCGG CCGAATTGAC CGATAACGAA CCGTCTTTCA TTCAAGAGGA TCACCTTCGG AGCGGCACCG ACGAGGAACA GTCCATTCAG CGACTCGTCT ACGTCACTTT CACGATGAAC GATGTGTACG ACGCGAAGGC GATGGAGCAG GCCTTCACCG AATACGCGCG CCAGCACAGC AGCTATCACG CGCAGTACGT CAAGTGCGAC TGCGGCTACC GAGCGCGGTA CATCCGCCCC GAAGACATGG AGCTCGAGGT GGTGGCCACC TCCGAGACCA CGGGCCCCGG TGTCACCGCG AAGGACTATC TGCAAACCGC CCTCCCGGAT CTCGACGAGT GGTCCTCGTT CGCCTTCGGA GTCACCGGTA TCGAGGCGGC GCAAGACGAC TCGCCGGCCC CGCAGTTCAC CGTGCTGATC GCGGCCGACC ACCTGTTCAC CGATGCGATC TCGCTGTCGA TCTCCTTCTA CGAACTGGTC TCCCGCTACG CCGCGATCCG GGCGGGCCGG GAGTACGTGG CTCCCCCCGT ACGCTCCTAC CGCGATTTCA GCGCGGAGCA GCGGGAACGG GCACGGGGCC TACACCCGGA CCATCCCGAG GTGACGCGCT GGCGCGAAAT CGTCCGGCGA GCCGGTGGGA TGCCCCGCTT CCCGCTTCCG CTGGGCCTCG AATCGGGTCG CGGCATCGCA CCGCAGATCG CCGTGGAAAC GGCGTTCCTC GACCCCGCGC GGGTGGCCGC CTTCGCCGCC ACCGCGAAGC AGTGCGGTGG CGCCATGGGC AGCGCCCTGT TGGCAGTGCA GGCCGAACTC GAACGCGAGC TCACGGGCAG CGATCTCTTC ACCATGATGG CGCCACGGTC GCACCGCCCC GACCCCACGG ACCTCATGGC CGTCGGGTGG TACATCACGC TGGTCCCGGT GCAGTTCTCC ACACGAGGCG ACTTCCCCGA CCTGGTCAAG GCGGCTCAGC AGGCCCTGAC CACCGCGCGG GAACTCGAGC GGCTCCCGGT GTTCCCGGTG ATCGACGTGC TGCGCGACGA CCCGGACTTC CCGGTGGACC ACGGTTTCGA TGCCCCGATG CTGTCGTATA TCGATATCAC GCGAACTCCG GGTGCCGAAT TGGCGCGCAG TCACGACGTT TCCATATTCG CGAATGAGAC GCCGATGCGC GAGGTGTACA TGTGGATCAA TCGCGACGCC GACGGGCTCG ATTTCCGGGC GATGTACCCC GGCAATCCTA CCGCGGAGGC ATCGGTGCGG ACCTATTTCT CCCTGCTCCG CGATCGCCTG GACCAGCTCT CACACGTCTG A
|
Protein sequence | MNAAELTDNE PSFIQEDHLR SGTDEEQSIQ RLVYVTFTMN DVYDAKAMEQ AFTEYARQHS SYHAQYVKCD CGYRARYIRP EDMELEVVAT SETTGPGVTA KDYLQTALPD LDEWSSFAFG VTGIEAAQDD SPAPQFTVLI AADHLFTDAI SLSISFYELV SRYAAIRAGR EYVAPPVRSY RDFSAEQRER ARGLHPDHPE VTRWREIVRR AGGMPRFPLP LGLESGRGIA PQIAVETAFL DPARVAAFAA TAKQCGGAMG SALLAVQAEL ERELTGSDLF TMMAPRSHRP DPTDLMAVGW YITLVPVQFS TRGDFPDLVK AAQQALTTAR ELERLPVFPV IDVLRDDPDF PVDHGFDAPM LSYIDITRTP GAELARSHDV SIFANETPMR EVYMWINRDA DGLDFRAMYP GNPTAEASVR TYFSLLRDRL DQLSHV
|
| |