Gene Namu_0435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0435 
Symbol 
ID8446016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp471851 
End bp473371 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID645039570 
Producthypothetical protein 
Protein accessionYP_003199844 
Protein GI258650688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCTA GCCTGCCGTC TTGGGATCAA TTGATTCTCA ATCTGGCGAG CGAGAACCTT 
GGCGACAACG AACGCCGCGT GTTTCTGACC CTGAACTATG ATGGCCTCGA ACGGCGCGCC
GAGACGATCT TGCATCTGTC AAAGAAGGCG CGCCCAGCCA AGCAGTTCGA CGAGCTGATC
CGTGACTCAC TACTGAAGGA CCCCGACGAA CTCGCGCCTG GCGAGCTCGC CGAGGGGATA
GCGCGGCTCT CTCAAGCTTA CCCACGGCGG ACAAGTATCC TGACAACCAA TTACGACTGG
ATTCTTGAAC TTGCAATCGA AGGACTCGGA ATCAAGGCAT CCGATTCACA TTCCTTTGAT
TCGTGGGATG AATGGGAGCA TCTCAATGAC GCCGACCACC GGTCGAACGT TATGCACGTT
CACGGCATGC TGTCGCGGCC ATCAGTAGGA AAACGGGAGC CGCTGGTTCT TTCAGAAAAC
GACTTCCTGA TGCACGGGCC CGAGATTCGG ACTCGCCTGC TAGATTCGCT GCGTGGAAAG
CTTGTCGTCA TGCTCGGTGT GAGCCTAACC GACAGCAACT TGCTGGCGCC TCTCCATCAA
TTGAACGGGG CCGACGGAGA GAGATACGTT GTTATAGTCC CTCCACTATG CCATAATAAC
CTGACGCAGC TCGAGTGCGC CGAGTACGCA GTCGCCCATT CTGAATCCTT AGCCAAGTAT
CTTAACGTCA GGCCCATCAT CTTGAAAAGC TTTGGGCAAG TCGCTCAGCT GATTAGTGAC
CTTGGGTTGG CGGCTCGTGA GCCAGCATTA TATCGACGGC CTAGCGGTAG TCTTCGAGTT
TCGCAATCAC TAATGTACGG GACTAGGTTC ACTGCCGCCC TAAAGGACGC CTACACTTCG
GTGGGGGCTA GCGCTCGCTC CGGCAACATG CGGGATTGGG ACGCGGTTGG CTTGAGCAAT
TTCATGCATG AACTCGCGAA TTCTCGCGGG GGGCCCGTAA AGTTCCTTGA CAAGATCCGG
CTGCATCACC GAGGGAAATG CGATGTTTCG GAGAACCTCG GGGTCTTCCT CTGGCTCCGG
GACCTCCCCA ATCGTCCGGA ATCCATATAT GGGCTGCGCC TGATTGCTAG TTCAGCGTAC
GCTCACTGGA AGTCCTGGTC GTCATTTAGG GTTGAGCCGA TTCGCAGCGA CTCGCGTCAC
GCAGCCGTTC AGGCAGTCTT CTTTAATCAC GCCCAAGGTG TCAATATCGA CCCTGATTCC
CACTCCGGAG CCTGGAAAGG GGCGTTCGCT ATCCCATTAT CAATATATGA CTATCGATCC
ACTGCAACGG TCAGAGGCTG GTCGCTTGAC AGGCTTACTA TCGGGGCATT GGCAGTAAAT
AGCGATCACT TTGTCGACGC TTCCAGTCTC GATTCTCCCA ACCCGAATCA GTTATCTGCA
TTGTCTGTTC TTACTCAACG CGAGCTCGAG GGCTTTGCAG CGTCTCTGTA CAAGATGGCG
GAGAGGATCT TCGGCGACTA G
 
Protein sequence
MDASLPSWDQ LILNLASENL GDNERRVFLT LNYDGLERRA ETILHLSKKA RPAKQFDELI 
RDSLLKDPDE LAPGELAEGI ARLSQAYPRR TSILTTNYDW ILELAIEGLG IKASDSHSFD
SWDEWEHLND ADHRSNVMHV HGMLSRPSVG KREPLVLSEN DFLMHGPEIR TRLLDSLRGK
LVVMLGVSLT DSNLLAPLHQ LNGADGERYV VIVPPLCHNN LTQLECAEYA VAHSESLAKY
LNVRPIILKS FGQVAQLISD LGLAAREPAL YRRPSGSLRV SQSLMYGTRF TAALKDAYTS
VGASARSGNM RDWDAVGLSN FMHELANSRG GPVKFLDKIR LHHRGKCDVS ENLGVFLWLR
DLPNRPESIY GLRLIASSAY AHWKSWSSFR VEPIRSDSRH AAVQAVFFNH AQGVNIDPDS
HSGAWKGAFA IPLSIYDYRS TATVRGWSLD RLTIGALAVN SDHFVDASSL DSPNPNQLSA
LSVLTQRELE GFAASLYKMA ERIFGD