Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4090 |
Symbol | |
ID | 8449713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4516118 |
End bp | 4518055 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043137 |
Product | Xylose isomerase domain protein TIM barrel |
Protein accession | YP_003203369 |
Protein GI | 258654213 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00816269 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAACCG CCACATCCCG TCCGCCGCTG CGTACCGCGA TCGCCACCGT TTGCATCTCC GGCACGCTGG AGGACAAGCT CGCCGCCGCC GCGGCGGCCG GCTTCGACGG CGTGGAGATC TTCGAACCGG ACTTCGTGGT GTCCTCGTCG TCGGCGCGCG AGGTGCGGCA GCGGTGCGCC GACCTGGGCC TGTCGATCGA TCTGTACCAA CCGTTCCGGG ACTTCGATTC CACCGACCCG GCGCAGGTGG AGCTCAACCT GCGCCGCGCG GACCGCAAGT TCGACGTGAT GGAAGCCCTG GGCACCGACC TGATCCTGGT CTGCTCGGCG GTCTCCCCGA CCGCCGTCGA CGACAACGCG GTGATCGCCG AGCAACTGCA CCGGCTGGCC GAACGGGCTC GGCAGCGGGG CATGCGCATC TCCTACGAGG CGCTGGCCTG GGGCACCAAG GTCAACACCT ACGACCGGTC CTGGGACATC GTCCGGGCCG CCGATCACCC GGCGCTGGGC GTCTGCCTGG ACAGCTTCCA CATCCTGTCC CGCGGGTCGG ACCCGGCCGG CATCGAGCAG ATCCCGGGCG AGAAGATCTT CTTCCTGCAG CTGGCCGACG CGCCGTTCAT GAACATGGAC GTGTTGCAGT GGAGCCGGCA CCACCGGCTC TTCCCCGAGC AGGGCACGTT CGACCTGCCC GCCTTCCTGG GCCATGTGTT GACCGCCGGT TACACCGGTC CGCTCTCGCT CGAGGTGTTC AACGACGTGT TCCGCCAGTC CGACCCGGGC CGGGCGGCGG TGGACGCGCA CCGCTCGCTG CTGGCCCTGT ACGAGTCGAC CGTGGGCAGC GTGCCGCTGG ACCGGGCCGG CTCGGTCACC GGCGATTCCG GCGCGGTCGG GACCTCGGGG TCGGACCGGC AGGTGCCGCC GGCCCCGGAA CTGGGCGGCT TCGCCTTCGC CGAGTTGGCC GTCGACGACG AGAGCGGCCC GGCGGTCGCC GCCACCCTGT CCTGCCTGGG CTTCGTGCAC TCCGGGCAGC ACCGCAGCAA GCCGGTGCAG CTGTGGTCGC AGGGTGATGC GCGGGTGCTG CTCAACGCCG CCCGCGGGCT CGACGAACAC CCGACCGGGG CCGCGGTAGC GGCCATCGGC TTCGAGACCC GGGACCCGGC GGCCGCGGCC GTGCGCGCCC AGGCCATGCT CGCCCCGCTG CTGCCCCGCC GGGTCGGCGC CGGCGAGGCC GACCTGTCCG CGGTGGCCGC CCCCGACGAG ACCGCGGTGT TCTTCTGCCG GACCGGGGCC GCGGACGCGA GCAGCTGGAT CGGCGATTTC GAACCGACCG CCGCCGCGGC CGGGACCTCC CCATTGGGGC TGCGCCGCAT CGATCACGTG GCCCTCACCC AGCCCTACGA CCGGTTCGAC GAGGCCAGCC TGTTCTTCCG CTGCGTGCTG GGCCTGCCGA CCCGGCACAG CTCGGAGATC GCCGCGCCGT TCGGCCTGGT CCGCAACCGG ACCGCGGCCA ACGTCGACGG CTCGGTCCGG ATCGGGATGA CCGTCTCGGT GCTGCGCCGC GGCGGCCAAT GGGCGCCCGG GGTGACCGAC CCGCAACATG TGGCGTTCGC CACCGACGAC ATCGTCGCGG CCGCTCGAGC GGCCGTCGCG GCCGGCGCCC CGGTACTGCC CGTGCCGGCC AACTACTACG ACGACCTGGA CGCCCGGCTG GCCCTGCCGG CCGAGCAACT GGCCGCGCTG CGCGAGCTGA ACCTGTTGTA CGACCGCACA TCTGACGGCG AGTTCTGGCA CTTCTACACC GCCGTCCTGG GCGGGCGGGT GTTCTTCGAG GTGGTCCAGC GGATCGGCGA CTACCAGGGC TACGGCGAGG TCAACTCGCC GGTACGGATG GCCGCGCACC GTCGGCAACG ACGCGCCACC ACGTCCATCT CGTCCTGA
|
Protein sequence | MATATSRPPL RTAIATVCIS GTLEDKLAAA AAAGFDGVEI FEPDFVVSSS SAREVRQRCA DLGLSIDLYQ PFRDFDSTDP AQVELNLRRA DRKFDVMEAL GTDLILVCSA VSPTAVDDNA VIAEQLHRLA ERARQRGMRI SYEALAWGTK VNTYDRSWDI VRAADHPALG VCLDSFHILS RGSDPAGIEQ IPGEKIFFLQ LADAPFMNMD VLQWSRHHRL FPEQGTFDLP AFLGHVLTAG YTGPLSLEVF NDVFRQSDPG RAAVDAHRSL LALYESTVGS VPLDRAGSVT GDSGAVGTSG SDRQVPPAPE LGGFAFAELA VDDESGPAVA ATLSCLGFVH SGQHRSKPVQ LWSQGDARVL LNAARGLDEH PTGAAVAAIG FETRDPAAAA VRAQAMLAPL LPRRVGAGEA DLSAVAAPDE TAVFFCRTGA ADASSWIGDF EPTAAAAGTS PLGLRRIDHV ALTQPYDRFD EASLFFRCVL GLPTRHSSEI AAPFGLVRNR TAANVDGSVR IGMTVSVLRR GGQWAPGVTD PQHVAFATDD IVAAARAAVA AGAPVLPVPA NYYDDLDARL ALPAEQLAAL RELNLLYDRT SDGEFWHFYT AVLGGRVFFE VVQRIGDYQG YGEVNSPVRM AAHRRQRRAT TSISS
|
| |