Gene Namu_4941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4941 
Symbol 
ID8450572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5518328 
End bp5519506 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content75% 
IMG OID645043980 
Producthypothetical protein 
Protein accessionYP_003204204 
Protein GI258655048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCAAC GCATCACCGT CGAGCAGCGC CGGGCCCGGC TCGGCCTCCG CCATGGCCTG 
GCCGCCCGAT CGGTCGGCCG CCCGGCCGGC GCGGTTGCCG CCGACGTGCT GGTACTGCAC
GCGACCGACC CGGCCACGGT CTACCTCTCG GTGGCGGCGC GCGCGGCGGA CCTGGCTCCG
GACGACCTGG GCCGCGCGCT GTACGAGGAC CGGTCGCTGG TCCGGATGCT GGGCATGCGC
CGGACGATGT TCGTCGTGCC GTCGGACCTG GTGGCGCTGG TGCAGCGATC GTCGTCCGAC
GCGGTCGCTG CTCGGCTGCG GGCGGCGCTG ATCAAGGACC TCACGGCGGT GGTGCAGGCA
CCGGGCGCCT GGCTGGCCGA CGTGGAGGGG TCGGTGCACG AACTCGTGCG GACCACCGGC
GGGTCGCCGG CGACGGCGCT GTCCACCGCC GAGCCGCGGC TGCGGACCAA GCTCGTCTAC
GCCGAGGGCA AGGCCTACGG CGGGTCGAGC ACCATCACCA CCCGGGTGTT GAACCTGATG
GCCGCCGACG GGCTGGTCGT GCGCGGGCGC ACCAAGGGTG CCTGGACCGG TGCCCAGTAC
GAGTGGGGGC CGATCGAGAG CTGGTTCCCG AACGGCATCG CCGAGCTCGA TCCGGCGGTG
GCTCGGGCCG GGCTGGTGCG GGCCTGGCTG GCCCGGTTCG GGCCGGCCAC GGTGGCCGAC
GTCGCCTGGT GGACCGGGTG GAACGGCCGC GACACCAAGG CCGCGCTGGC CGCCGCCGGC
GCCGTCGACA TCGACCTCGA CGACGGGCCC GGGGCCGTGC TCGCCGCGGA TCTGGATCCG
GTCCCGGTCC CGGCGCCGTG GGTGGCCCTG CTGCCCGCGC TCGATCCGAC GCCGATGGGC
TGGATCGAGC GCGACTGGTA CTTCCCGCCC GAGTTCAAAC CGCTGCTGTT CGACCGCACC
GGCAACATCG GGCCGACCGT GTGGTGTGAC GGCCGGGTGG TCGGCGGCTG GGCGCAGCGA
CCCTCCGGTG AGGTGGTGAC CCGGCTGCTG ACCGACATCG GCGCGGCGGC CGGCGCCGCC
GTCGCGGCCG AGGCGGCCCG CCTGCAGGAG TGGATCGGGC CGGCCCGGGT CATCCCCAAG
TTCCGGGTGC CGCTGGACCG CGACCTGGTC GCCGGCTGA
 
Protein sequence
MGQRITVEQR RARLGLRHGL AARSVGRPAG AVAADVLVLH ATDPATVYLS VAARAADLAP 
DDLGRALYED RSLVRMLGMR RTMFVVPSDL VALVQRSSSD AVAARLRAAL IKDLTAVVQA
PGAWLADVEG SVHELVRTTG GSPATALSTA EPRLRTKLVY AEGKAYGGSS TITTRVLNLM
AADGLVVRGR TKGAWTGAQY EWGPIESWFP NGIAELDPAV ARAGLVRAWL ARFGPATVAD
VAWWTGWNGR DTKAALAAAG AVDIDLDDGP GAVLAADLDP VPVPAPWVAL LPALDPTPMG
WIERDWYFPP EFKPLLFDRT GNIGPTVWCD GRVVGGWAQR PSGEVVTRLL TDIGAAAGAA
VAAEAARLQE WIGPARVIPK FRVPLDRDLV AG