Gene Namu_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4090 
Symbol 
ID8449713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4516118 
End bp4518055 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content72% 
IMG OID645043137 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_003203369 
Protein GI258654213 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00816269 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAACCG CCACATCCCG TCCGCCGCTG CGTACCGCGA TCGCCACCGT TTGCATCTCC 
GGCACGCTGG AGGACAAGCT CGCCGCCGCC GCGGCGGCCG GCTTCGACGG CGTGGAGATC
TTCGAACCGG ACTTCGTGGT GTCCTCGTCG TCGGCGCGCG AGGTGCGGCA GCGGTGCGCC
GACCTGGGCC TGTCGATCGA TCTGTACCAA CCGTTCCGGG ACTTCGATTC CACCGACCCG
GCGCAGGTGG AGCTCAACCT GCGCCGCGCG GACCGCAAGT TCGACGTGAT GGAAGCCCTG
GGCACCGACC TGATCCTGGT CTGCTCGGCG GTCTCCCCGA CCGCCGTCGA CGACAACGCG
GTGATCGCCG AGCAACTGCA CCGGCTGGCC GAACGGGCTC GGCAGCGGGG CATGCGCATC
TCCTACGAGG CGCTGGCCTG GGGCACCAAG GTCAACACCT ACGACCGGTC CTGGGACATC
GTCCGGGCCG CCGATCACCC GGCGCTGGGC GTCTGCCTGG ACAGCTTCCA CATCCTGTCC
CGCGGGTCGG ACCCGGCCGG CATCGAGCAG ATCCCGGGCG AGAAGATCTT CTTCCTGCAG
CTGGCCGACG CGCCGTTCAT GAACATGGAC GTGTTGCAGT GGAGCCGGCA CCACCGGCTC
TTCCCCGAGC AGGGCACGTT CGACCTGCCC GCCTTCCTGG GCCATGTGTT GACCGCCGGT
TACACCGGTC CGCTCTCGCT CGAGGTGTTC AACGACGTGT TCCGCCAGTC CGACCCGGGC
CGGGCGGCGG TGGACGCGCA CCGCTCGCTG CTGGCCCTGT ACGAGTCGAC CGTGGGCAGC
GTGCCGCTGG ACCGGGCCGG CTCGGTCACC GGCGATTCCG GCGCGGTCGG GACCTCGGGG
TCGGACCGGC AGGTGCCGCC GGCCCCGGAA CTGGGCGGCT TCGCCTTCGC CGAGTTGGCC
GTCGACGACG AGAGCGGCCC GGCGGTCGCC GCCACCCTGT CCTGCCTGGG CTTCGTGCAC
TCCGGGCAGC ACCGCAGCAA GCCGGTGCAG CTGTGGTCGC AGGGTGATGC GCGGGTGCTG
CTCAACGCCG CCCGCGGGCT CGACGAACAC CCGACCGGGG CCGCGGTAGC GGCCATCGGC
TTCGAGACCC GGGACCCGGC GGCCGCGGCC GTGCGCGCCC AGGCCATGCT CGCCCCGCTG
CTGCCCCGCC GGGTCGGCGC CGGCGAGGCC GACCTGTCCG CGGTGGCCGC CCCCGACGAG
ACCGCGGTGT TCTTCTGCCG GACCGGGGCC GCGGACGCGA GCAGCTGGAT CGGCGATTTC
GAACCGACCG CCGCCGCGGC CGGGACCTCC CCATTGGGGC TGCGCCGCAT CGATCACGTG
GCCCTCACCC AGCCCTACGA CCGGTTCGAC GAGGCCAGCC TGTTCTTCCG CTGCGTGCTG
GGCCTGCCGA CCCGGCACAG CTCGGAGATC GCCGCGCCGT TCGGCCTGGT CCGCAACCGG
ACCGCGGCCA ACGTCGACGG CTCGGTCCGG ATCGGGATGA CCGTCTCGGT GCTGCGCCGC
GGCGGCCAAT GGGCGCCCGG GGTGACCGAC CCGCAACATG TGGCGTTCGC CACCGACGAC
ATCGTCGCGG CCGCTCGAGC GGCCGTCGCG GCCGGCGCCC CGGTACTGCC CGTGCCGGCC
AACTACTACG ACGACCTGGA CGCCCGGCTG GCCCTGCCGG CCGAGCAACT GGCCGCGCTG
CGCGAGCTGA ACCTGTTGTA CGACCGCACA TCTGACGGCG AGTTCTGGCA CTTCTACACC
GCCGTCCTGG GCGGGCGGGT GTTCTTCGAG GTGGTCCAGC GGATCGGCGA CTACCAGGGC
TACGGCGAGG TCAACTCGCC GGTACGGATG GCCGCGCACC GTCGGCAACG ACGCGCCACC
ACGTCCATCT CGTCCTGA
 
Protein sequence
MATATSRPPL RTAIATVCIS GTLEDKLAAA AAAGFDGVEI FEPDFVVSSS SAREVRQRCA 
DLGLSIDLYQ PFRDFDSTDP AQVELNLRRA DRKFDVMEAL GTDLILVCSA VSPTAVDDNA
VIAEQLHRLA ERARQRGMRI SYEALAWGTK VNTYDRSWDI VRAADHPALG VCLDSFHILS
RGSDPAGIEQ IPGEKIFFLQ LADAPFMNMD VLQWSRHHRL FPEQGTFDLP AFLGHVLTAG
YTGPLSLEVF NDVFRQSDPG RAAVDAHRSL LALYESTVGS VPLDRAGSVT GDSGAVGTSG
SDRQVPPAPE LGGFAFAELA VDDESGPAVA ATLSCLGFVH SGQHRSKPVQ LWSQGDARVL
LNAARGLDEH PTGAAVAAIG FETRDPAAAA VRAQAMLAPL LPRRVGAGEA DLSAVAAPDE
TAVFFCRTGA ADASSWIGDF EPTAAAAGTS PLGLRRIDHV ALTQPYDRFD EASLFFRCVL
GLPTRHSSEI AAPFGLVRNR TAANVDGSVR IGMTVSVLRR GGQWAPGVTD PQHVAFATDD
IVAAARAAVA AGAPVLPVPA NYYDDLDARL ALPAEQLAAL RELNLLYDRT SDGEFWHFYT
AVLGGRVFFE VVQRIGDYQG YGEVNSPVRM AAHRRQRRAT TSISS