Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2554 |
Symbol | |
ID | 9156715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2648098 |
End bp | 2649972 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | Protein of unknown function DUF2075 |
Protein accession | YP_003647496 |
Protein GI | 296140253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0156913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGTTGC TGCGGGTCAC CGCGAACCGA CTGTTGGAGG AGTACACCTC GCAGTCACTC GTGGAGAGCT TGCTGGAGCG CATGTTGTTC GAACGTGGAG TGACAGTCGC CGAGGCGGAG CAGCGTTCCT GGTCGAATAG CCTGCCCAAG CTGGCCGCAG ACCTCCGGCA GGGCGGACTC GGTGGCGTCG AGATGCTCAT CGAGTACAAG CTTCCCCTCT CATCCCGCCG CGCTGACGTC GTGCTCGCAG GCATCCATCC GCACACCGGC GAACCGTCCT ATGTCGTGGT CGAGTTGAAG CAGTGGTCGT CCGCTACCGC GTTTGAGGGT GACATTCGAA TGGTGAGCGT GCCGCACCTG GGCGGTCCGG TTTTGCACCC GGTCGCACAG GTGGCCGGCT ACTGCCAGTA CATCGGTGAC TTCGCGCGCT CGCTTCGCGA CCAGGAGGAT CCGCTGGCGG GTGTCGCGTA TCTGCACAAT GTGACACAGC GGGGTGCGAT CGAGGACCTG ATCGACTTCC CGGTGACGAA TGCGGGCCGA ATGTTCACCG GTGCCGAGAC GGACCGGCTG CTTGACTTTC TGCGCACTCG GCTCGCCCCG GATGCTCACC CAGGGCCAGC GGCCGATGCG TTGTTGAACT CCCCGGCGGC GCCGTCACAG CAGCTCCTGG CAGTCGCGGC GGAAGAGGTA CGCGAGCGCC AGGTCTTTCA CCTGCTGGGG AATCAGAAGC TCGCTGTTGA CTTGGTGCTG CACGATGTGG AACGAGCGCG CGCAGCTGAT ACGAAGCGAG TGATAGTCGT GACCGGTGGC CCCGGTAGTG GCAAGAGCGC AATCGCTCTC GCTTTGCTCG GTGACCTCGC TCGGCAAGGG CGGACGGTAC TGCACGCCAC GGGATCTCGT TCCTTCACTA CGACCCTGCG GCAGGTGGCG GGTGAACGCG CGCCGCGTGT CAAGGCGATG TTCAAGTACT TCAATCAGTT CGTCGCTGCA GAGCGGAACG GCCTCGACGT GCTGATTCTC GACGAGGCGC ACAGGATCCG GGAGACTTCT GCGAACCGGT TCACCAAAGC GATGAACAGG ACTGGCAGAC CCCAAGTCGA TGAGCTGATC GCCGCCGCGC GGGTTCCGCT GTTCCTCTTG GACGAGCACC AGGTGGTCCG CCCTGGGGAG TTGGGTTCTC TTCCACAGAT CCGTGCCTAC GCCGCATCGC TCGGACTCGA GACGATACAT ATCCACTTGG ATGACCAGTT CCGTTGCGGC GGAAGCGAGC GGTACGTCGA GTGGGTCCTC GACCTGGTGG GTCTGTCAGA GAAGAAGGCG TGGACGTGGA CCGGAGACGA AGGCTTCGAC GTCCGGGTTG CCGAGACGCC ATGGGATCTC GAACGAATGC TCGACGACAA ACGCGCGGAG GGCTACTCAG CCCGGATGAC AGCAGGCTAC TGCTGGTCGT GGAGCGACCC GACACCCGAG AAGACTCTGG TCCCAGACGT CGAGATCGGT GACTGGGCCC GCCCGTGGAA CTCCAAATCC GATCGCCGCA TCGGTGACGC GCCGCCCTCG CAGCTCTGGG CGACCGACGA CGGCGGATTC GGGCAGGTGG GGTGTGTATA CACCGCGCAG GGATTCGAGT ACGACTACAG CGGCGTAATC CTAGGCCCCG ACTTCGTTTG GCGTGATGGG CGATTCGTTG TCCAGCGTGA TCAGAACAAG GATCCGAGCC TGAAGTCGAA GAAGGGTCTC TCGGATCCGC GGTTCGATCT CTTGATCCGC AATACGTACA AGGTTCTGCT CACCCGCGGG ATGTCTGGAA CCGTTCTCTA CTCCACCGAT GAAGAGACTC GCGAGGCGCT CACGCGCTTC GTGCAACCGA TGTAA
|
Protein sequence | MTLLRVTANR LLEEYTSQSL VESLLERMLF ERGVTVAEAE QRSWSNSLPK LAADLRQGGL GGVEMLIEYK LPLSSRRADV VLAGIHPHTG EPSYVVVELK QWSSATAFEG DIRMVSVPHL GGPVLHPVAQ VAGYCQYIGD FARSLRDQED PLAGVAYLHN VTQRGAIEDL IDFPVTNAGR MFTGAETDRL LDFLRTRLAP DAHPGPAADA LLNSPAAPSQ QLLAVAAEEV RERQVFHLLG NQKLAVDLVL HDVERARAAD TKRVIVVTGG PGSGKSAIAL ALLGDLARQG RTVLHATGSR SFTTTLRQVA GERAPRVKAM FKYFNQFVAA ERNGLDVLIL DEAHRIRETS ANRFTKAMNR TGRPQVDELI AAARVPLFLL DEHQVVRPGE LGSLPQIRAY AASLGLETIH IHLDDQFRCG GSERYVEWVL DLVGLSEKKA WTWTGDEGFD VRVAETPWDL ERMLDDKRAE GYSARMTAGY CWSWSDPTPE KTLVPDVEIG DWARPWNSKS DRRIGDAPPS QLWATDDGGF GQVGCVYTAQ GFEYDYSGVI LGPDFVWRDG RFVVQRDQNK DPSLKSKKGL SDPRFDLLIR NTYKVLLTRG MSGTVLYSTD EETREALTRF VQPM
|
| |