Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2531 |
Symbol | |
ID | 9156692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2622243 |
End bp | 2624069 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003647474 |
Protein GI | 296140231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.41472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTACACCC GCGAGGCATA CATCCACAAC AGCGGACCGA TCAAGGATCT TTATGTCAAG TTTGAGGTCG ACGAGAACGA CCGGCCTATC CCGACCTTGT TTGTAGGACG CAACGGCGCA GGCAAGACGA ACCTTCTCTC AACGTTGGCC GAGCCTCTAT TACTAGGTGC GAGCAAAGTT TACGACGACG TACTTACCCT AAAAGGGTTT GGACGAAGCT ACTTCCGCAT TCTAGGTGCT AGCACAGTTA CCACCGGCGC CAGCAACGGG TTTGCGATAG TCAAGTACAC AAACGGAGCC GAATCTGTCT TCTACCGGGA GAATGCCGGA GATATTTCTG TCGACGATGC TAAATCACTA ATTCCGGACA CGCTCCTCGA AGGCGCAAAC TGGGACGCCA AAGAATCCTC AAAAGAGATC ATCCTCAACG ATGAGGCGGC GCGGCAGGTG TTCGGGAACG GCGCCTACAT ATTTTTCCCA TCAAACAGGT CAGAGACTCC GTACTGGTTC AACACCGAGG CGATTCCCGA TCAAAAATTT GATACCTCGG ATCGATTTAG AAACAATCTC AATAACCCTA TGTACGTTGA GCGAAGCCTC GACGAGTTCG CGCAATGGCT ATTGGGCGTC CTCGCAGAAT CGCGACTCCC GGTGCTCGAA GCGGCATACC TAGCGAAATC CGAAGAAAGC AAAGTTGTGG TGACGTGCGA CACAACGCAG TTCATGGTCG GTCAGACGCC ACTCATTCAA GCCAACAACA TACTTAAGGC TATCACCAGA TCAAGCGATG CCGCCTTCTA TTGGTCCAGC CGCCACGCCA GTTCCAAGGT TGGAATTCAC AAGGGCGGCG TCAACCTATG GTCGGGCCTT AAGAGCCTGT CAGCGGGCGA AGGCACCCTC CTATCCGTGT TCGGCTCACT ACTTCGTCGC AGCGACCAGC TCAGGATAAC TCCCGAGGAA TTGACCGGAG TCGCCATCAT CGACGAACTG GACGCGCATA TCCATATCGA ACTTCAAATG AGCGCACTTC CTGAGCTAAT CGCCATGTTC CCACGTATCC AATTCATTAT TTCCAGCCAC TCGCCGTTCT TCGCATTGGG CATGGAGTCC AAGTTTCCAG GCAAGATCAA TATCGTCGAC CTACCTTCGG GTCAGAACAT CAACGCGGAA TCATACACAG AATTCAAGGC CGCGCTTGAG ACTTTTTACG ACACTCGTAG GTTTGAGGAA ATCGTAGACC AAAGACTCCG GGAATCGAAT ACGCCAACAA TCTTGGTCGG GGGCACTACT GACCGAGACT ACTTTAAAGC AGCTTCTATT GCCCTTGGTT ACGATGATCT GGCCGACCTA TTTGAGTGGG TTGGGGAACC TGGAGGATCA GGCGGTGGCC GAAACACAGG CGACTCAAGC CTGCAGAAGG CTGAATCATT CATCCGAGCC AACCCTTCAC TCGTCTCCAA GGATGTCGTA ATTCTATACG ATTGCGACGC CAAGGCCACG TCCAGTAGCT CCGGCAAGCT ACACATCGTG CCGATAGAAC AAATACCCGG TGCTCGCTGC AGCAAAGGCG TAGAGAACCT GCTACCCGAC TCAGTCTTCA CCGAGGACGC GTATGCCATC CAAGAGAAGC CGGACAGCTA CGGTGGCGGA GCGACGGTCA AATCGATCAA GAAGACCTAT CTATGCGAGC AGGTCTGTGC CCAACCGAAG AAATCAGCAG AAATTTTTGA GAACTTCAGG CCGACTTTGG ACGCCATATC GTCACTTCTC GGCACGACGA GGCCCCAATC CGATTCCCAA ACTACCCCCA CCCCCGCAGC GGAGTGA
|
Protein sequence | MYTREAYIHN SGPIKDLYVK FEVDENDRPI PTLFVGRNGA GKTNLLSTLA EPLLLGASKV YDDVLTLKGF GRSYFRILGA STVTTGASNG FAIVKYTNGA ESVFYRENAG DISVDDAKSL IPDTLLEGAN WDAKESSKEI ILNDEAARQV FGNGAYIFFP SNRSETPYWF NTEAIPDQKF DTSDRFRNNL NNPMYVERSL DEFAQWLLGV LAESRLPVLE AAYLAKSEES KVVVTCDTTQ FMVGQTPLIQ ANNILKAITR SSDAAFYWSS RHASSKVGIH KGGVNLWSGL KSLSAGEGTL LSVFGSLLRR SDQLRITPEE LTGVAIIDEL DAHIHIELQM SALPELIAMF PRIQFIISSH SPFFALGMES KFPGKINIVD LPSGQNINAE SYTEFKAALE TFYDTRRFEE IVDQRLRESN TPTILVGGTT DRDYFKAASI ALGYDDLADL FEWVGEPGGS GGGRNTGDSS LQKAESFIRA NPSLVSKDVV ILYDCDAKAT SSSSGKLHIV PIEQIPGARC SKGVENLLPD SVFTEDAYAI QEKPDSYGGG ATVKSIKKTY LCEQVCAQPK KSAEIFENFR PTLDAISSLL GTTRPQSDSQ TTPTPAAE
|
| |