Gene Namu_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2139 
Symbol 
ID8447750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2358163 
End bp2359323 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID645041262 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_003201506 
Protein GI258652350 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.000312689 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0185629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATTGC CACCGCTGGT GGAGCCGGCC GAGTCGCTGA CCAAGGCCGA GGTGGAACGC 
TATTCGCGCC ACCTGATCAT CCCGGACGTC GGCATGATCG GTCAGAAGCG ACTGAAGAAC
GCCAAGGTGC TCGTGGTCGG AGCCGGTGGG TTGGGTTCGC CCGCGCTGCT GTACCTGGCC
GCGGCCGGCG TCGGCACGCT GGGCATTCTG GACTTCGACA CCGTCGATGA GTCGAACCTG
CAGCGGCAGG TGATCCACGG CCAGTCCGAC ATCGGCAAGT CCAAGGCGCT GTCGGCGGCC
GAGTCGATCG CCGAGGTCAA TCCGTACGTG ACGGTCAATC TGCACACCGA GCGGCTGGAC
TCCGGCAACG CGTTGGAGAT CTTCGCCCCG TACGACCTGA TCCTGGACGG GACCGACAAC
TTCGCCACCC GGTATCTGGT CAACGACGCT TGCGTGCTGC TGGGCAAGCC CTACGTGTGG
GGTTCGATCT TCCGTTTCGA AGGCCAGGTC AGCGTGTTCT GGGCCGAGTA CGGCCCCCAG
TACCGCGACC TGTACCCCGA GCCGCCGCCG CCCGGCATGG TGCCCTCGTG CGCCGAGGGC
GGCGTCCTCG GTGTGCTGTG TGCCTCGATC GGCTCGGTCA TGGTCACCGA GGCGATCAAA
CTGATCACCG GCATCGGCGA GCCGCTGCTC GGCCGGCTGA TGGTCTATGA CGCGTTGGAG
ATGACCTACC GGACCGTACG CATCCGCCGC GACCCGGCCG GTGAACCGAT CACCGGGCTC
ATCGACTACG ACGCCTTCTG CGGAACCCTG TCCGACGAGG CGGCCGAGGC CGCGATCAGC
CACACCATCT CGGCGCGCGA CCTGAAGGCC AAGATGGACG CGGGCGATGA CTTCGTGCTG
ATCGACGTGC GCGAGCAGAA CGAGTACGAG ATCGTCTCCA TCCCCGGCTC GGTGCTCATC
CCCAAGGGGG ACATCATCTC CGGCGAGGCC CTGTCCTCGT TGCCGATGGA CCGGCCGCTG
GTGCTGCACT GCAAGTCCGG AGCGCGATCG GCCGAGGCGT TGGCCGTGCT GCACAAGGCG
GGCTTCGGGG ATGCGGTGCA CGTCGGTGGT GGGGTCCTGG CCTGGATCAA GCAGGTGGAT
CCGAGTCTGC CCACGTACTG A
 
Protein sequence
MTLPPLVEPA ESLTKAEVER YSRHLIIPDV GMIGQKRLKN AKVLVVGAGG LGSPALLYLA 
AAGVGTLGIL DFDTVDESNL QRQVIHGQSD IGKSKALSAA ESIAEVNPYV TVNLHTERLD
SGNALEIFAP YDLILDGTDN FATRYLVNDA CVLLGKPYVW GSIFRFEGQV SVFWAEYGPQ
YRDLYPEPPP PGMVPSCAEG GVLGVLCASI GSVMVTEAIK LITGIGEPLL GRLMVYDALE
MTYRTVRIRR DPAGEPITGL IDYDAFCGTL SDEAAEAAIS HTISARDLKA KMDAGDDFVL
IDVREQNEYE IVSIPGSVLI PKGDIISGEA LSSLPMDRPL VLHCKSGARS AEALAVLHKA
GFGDAVHVGG GVLAWIKQVD PSLPTY