Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0432 |
Symbol | |
ID | 9154567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 453967 |
End bp | 455967 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | YhgE/Pip C-terminal domain protein |
Protein accession | YP_003645412 |
Protein GI | 296138169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.347032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACA AGAAGTCCCC GAAGCGGAGC AACACCGTTC CGATCGCCGC CGGCCTCGCC TTCGGCTCCG AGATCAAGCG CTTCGGCCGC AGCCGGATGA CGCGCGCCGC GATCGTGGTG CTCATGCTGC TGCCGCTGGT GTACGGCGCG CTGTACCTGT GGGCCTTCTG GGACCCGTTC GGCAACGTCA ACAAGATGCC GGTGGCGCTG GTGAACGCCG ACAAGGGCAC TATCGTCAAG GATCAGAAGA TCAACGCGGG CGCCGAGGTC GAGAAGTCGC TCAAGGACGA CAAGAGCCTG GACTGGAACA TCGTCTCGCC GGCGGAGGCG CGGCAGGGAG TCAAGGACGG CACGTACTAC TTCATGCTGG AGCTGCCGGA GAACTTCTCC TCGGCGATCG CCTCGCCGAT GACGGGGAAG CCCGAGAAGG CGCAGCTGCG GGCCGTCTAC AACGATGCCA ACAACTACAT CAGCTCGACC ATCGGACAGA CCGCGATGGC GCAGGTGCTC AACGCGGTCT CGACCAGGGT GTCCGGGCAG GCCGTCAACC AGGTGCTGTC CATGATGATC ACGGCCGGAG CGGGGATCGA GCAGGCCGCC GACGGGGCGG GGAAGCTGGC CGACGGGCTG GTGACCGCGA ACGACGGTGC CGCCAAGCTC GCCGACGGCC TGGCGACTGC GAAAACGGGA TCGGCCCAGT TGAAGACGGG GTCGCAGCAG CTCTCGGACG GCATCAACCA GGCCACCGAT CCGATCCTCG CGGTGACTCG TGCGCTCTCC GGCCTGGGTG GGAACACCGC CGAGTTGCAG CAGAGCGCTA CGGCATTGGT CGAGGCCGCC GATAAGGCCG GGGTGTTCGT CAAGACGCAG GACACGGCGC TCGCCGGTAT CGATGCCGCG ATCCGTCAGC TCGCCGCGAG CCCCGACCCA GCTGCCCGAC AGGCCGCGGA CGGCCTACGC GCGATACGCG GGCAGCTGGC CCAGGCGTCG GTGACCGAGC AGACGAAACA GCAGATCATC GCCGCCCAGA ACGGGGCGAT CGCGCTCACC GAACAGCTGC GCACCCCGGG CAGCCCGCTC CAGACCGTAC TGTCGAAGGT GGGCGGCACG GGCGACGATC TGACCGCCAA GCTCACCCAG TTGCGCGACG GTGCCGCCCA GGTGGCCGCC GGTAACGCCC AGCTGGACAA CGGCATCACG CAGCTCTCGT CGGGCGCCGG CCAGCTCGCA ACGGGCACCG TGCAACTGAA GGACGGCAGC CGGGAGCTCG CGACGAAGCT CGGGGAAGGG GCGCAGTCCG TCCCGAAGTG GTCGCAGGAG CAGACCTCCC AGGTGGCTTC GACCATCGGC GGCCCGGTGC AGCTCGACTC GACGCACGAG AACCGGGCCC CGAACTTCGG TACCGGTATG GCGCCGTTCT TCCTCACGCT CGCATTGTTC TTCGGGGCGC TCGTGCTGTG GATGGTGTTC CGGCCCTTGC AGAACCGCGC CATCGCGGCG GAGGTGCTGG CGATCCGGGT GGTGATGGCG AGCTATCTCC CGGCGGCGGC GATCGCGCTG TGCCAGGCCG TGATCCTGTA CTGCGTGGTG CGGTTCGCCC TCGGACTGGA GGTCGCCCAC CCGGTGGGCA TGTTCTTGTT CATGTTGGGC GTCTCCCTCG CCTTCGTGGC GTTCACCCAG GCGGTGAACG CCCTCGTGGG GCCGGCGGTG GGCCGCGTGC TGATCATGGC GCTGCTCATG CTGCAACTGG TGAGTTCCGG CGGCATGTAT CCGGTGGAGA CCACCGCCCG ACCCTTCCAG GTGCTGCACA AGTACGACCC GATGACCTAT GGCGTGAATG GATTGCGCCA GTTGGTCTCC GGCGGAATCG ACCACCGGCT CTGGCAGGCG GTCGCCGTGT TGGTGTTCCT GTGGGTCGTC TCCGTGACCG TCTCGTGCCT CTCCGCGAGA CGGAACCGGT TGTGGAACCT CGACCGGTTG ATGCCCGCGA TCAAGATCTG A
|
Protein sequence | MSDKKSPKRS NTVPIAAGLA FGSEIKRFGR SRMTRAAIVV LMLLPLVYGA LYLWAFWDPF GNVNKMPVAL VNADKGTIVK DQKINAGAEV EKSLKDDKSL DWNIVSPAEA RQGVKDGTYY FMLELPENFS SAIASPMTGK PEKAQLRAVY NDANNYISST IGQTAMAQVL NAVSTRVSGQ AVNQVLSMMI TAGAGIEQAA DGAGKLADGL VTANDGAAKL ADGLATAKTG SAQLKTGSQQ LSDGINQATD PILAVTRALS GLGGNTAELQ QSATALVEAA DKAGVFVKTQ DTALAGIDAA IRQLAASPDP AARQAADGLR AIRGQLAQAS VTEQTKQQII AAQNGAIALT EQLRTPGSPL QTVLSKVGGT GDDLTAKLTQ LRDGAAQVAA GNAQLDNGIT QLSSGAGQLA TGTVQLKDGS RELATKLGEG AQSVPKWSQE QTSQVASTIG GPVQLDSTHE NRAPNFGTGM APFFLTLALF FGALVLWMVF RPLQNRAIAA EVLAIRVVMA SYLPAAAIAL CQAVILYCVV RFALGLEVAH PVGMFLFMLG VSLAFVAFTQ AVNALVGPAV GRVLIMALLM LQLVSSGGMY PVETTARPFQ VLHKYDPMTY GVNGLRQLVS GGIDHRLWQA VAVLVFLWVV SVTVSCLSAR RNRLWNLDRL MPAIKI
|
| |