Gene Namu_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4105 
Symbol 
ID8449728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4531634 
End bp4532656 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content73% 
IMG OID645043151 
Productoxidoreductase domain protein 
Protein accessionYP_003203383 
Protein GI258654227 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.416688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0136501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCT GGGGATTGAT CGGCGGCAGC GACATCGCCG CGACCCGGAT GATTCCGGCC 
CTGCGGGCCC TGGGGCAGTC CCCGGTGGCG GTGAGCAGCA GCAGCGCCGA GCGGGCCGAA
CTGTTCGCCG GCCGGCACGA GATCGCCCAC GCCTGCCGCG ATGTCGACGA GCTACTGGCC
CGGGACGACA TCGACGCGGT GTACATCAGC AGCCTCAACC GGTTGCACGC CGAGCACACC
ATCGCCGCCG CGGCCGCCGG CAAGCACGTG CTGTGCGAGA AGCCGGTCGC CCTGGACGTC
GCCGACGCCG CGGCGATGGT CGCCGCGTGC GATCGGGCCG CGGTGGTCTT CGCGGTCAAC
CACCATCTGC CCGCGCACAC GAGCAACACC GTGATCCGCC AGCTCGTCGC CGACGGGGCT
GTGGGCGAGG TCAGATCGAT CCGCGCGTTC TTCGCCTACG AGCTGGCCCC GCGGCTGCGC
GGCTGGCGGT TGACCGACCC GGCGGTCGGC GGCCCGATCC TCGACCTGGT CCCGCACGTG
GCGTCGGTGG TCAACAAGAT CGCCGGGACG CCGTCGTCGG CCGTCGCGAT CGCCGTCCGG
CAAGGCACCT GGGACGGGCC GGCACCCGAC GGTGCGGCAC TGCCCGAGGA CACCTGCATG
GCGGTGGTCC GCTACCCCGA CGACGTGCTC GTCCAGATCC ACGTCGGCTG GGCGACGCCG
CATGCCCGCA ACGGTCTGGA GGTCAACGGC AGCACCGGGT CCGTCGTCGG CACCGGCGTG
CTGTGGGCCG ACCCGATCGG CGCCGTGACC GTGGTGGACA GCGACGGGCG GCGCGAGATC
GCGCTCGAGC AGCACGTCGA TCCGTACCAG GAAACGCTGT CGGCCTTCGC GCGGGCGGTG
ACCGACGGCA CCCCACCGGT GGTGAGCGGC CGCGAGGCGG CCACCGCCCT GGCGCTGACC
CTGGCGGTCC GCCGGGCCGC GGCCAGCGGG ACCACGGAGC CGGTCGAGCT CGCATCCCCC
TGA
 
Protein sequence
MIRWGLIGGS DIAATRMIPA LRALGQSPVA VSSSSAERAE LFAGRHEIAH ACRDVDELLA 
RDDIDAVYIS SLNRLHAEHT IAAAAAGKHV LCEKPVALDV ADAAAMVAAC DRAAVVFAVN
HHLPAHTSNT VIRQLVADGA VGEVRSIRAF FAYELAPRLR GWRLTDPAVG GPILDLVPHV
ASVVNKIAGT PSSAVAIAVR QGTWDGPAPD GAALPEDTCM AVVRYPDDVL VQIHVGWATP
HARNGLEVNG STGSVVGTGV LWADPIGAVT VVDSDGRREI ALEQHVDPYQ ETLSAFARAV
TDGTPPVVSG REAATALALT LAVRRAAASG TTEPVELASP