Gene Namu_5234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5234 
Symbol 
ID8450865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5835598 
End bp5837355 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content70% 
IMG OID645044265 
Productmembrane protein 
Protein accessionYP_003204489 
Protein GI258655333 
COG category[S] Function unknown 
COG ID[COG4425] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACG TGACATCGAA CGAGGCAACC GCCGCGCAGG ATCGCGAACC GGTCGGAGAC 
GGGCGGTCGA ACGAGCCGGC CGAGCAGTCC GCACCGGCGT CCTCGGGTGC GGTTGCCGAC
CGTGGTCGGC GACCGCGGTT CCGCTATACC CTGCCCGGAT CCTGGACCGC GCTGGTGTTC
GTCTGCCTGG CGTTCACCCC GTCCCTGGTG CCCCGGCCGG GCGCGTTCCA GGGAGTGGTC
GGCGGTTTGA CCGGCGCCAT CGGGTACGGA TTGGGCGTGG CCGGTGCCTG GGTGTGGCGG
CAGTTCGCGG ACCGGCCGGC CCGCGCGGCA CGCCGCTGGT CGTGGTCGGC GTTCGGGATC
GTGGCCGGCG TCGCCCTGGT CACGTCCTAC CTTCTCGGAC AGCGGTGGCA GGACCAGATC
CGGGCGCTGG TGAACGCCGA ACCGCAGGAC CTGGGTTCTC GCCTGATCCT GCCCGTGGTA
GCCGTTTTGG TGTTCGTCGG GCTGGTCGCC GCCGGCCGGG GGATCGCAAA GGTGTACCGA
TGGGCGGCCC GGCGGCTGAG TCGATGGATG GGCGATCGAG CGGCCCGCGT CGTCGGCTGG
CTGCTGGCGG CCGGGCTGAC GGTCGGCCTG GTGTCCGGGG TGCTGGTCGA CGGGGTCCTG
GCGATCACCG ACCGAATGTT CGCCGTCCGC GACACGACGA CCAGCGACAC CGCGGTGCAG
CCGACCACCG GCCTGCGGTC CGGTGGTCCG GGATCGCTGA TCGGCTGGGA CACGTTGGGC
TACCAGGGCC GCAACTTCAC CGGTTCCGGT CCCACTCCCG GGCAGATCCA GGCGTTCATC
GGTGCTCCGG CGCCCGTACC GATTCGCGCC TACGCCGGCC TTGCGTCCGC GCAGGACGTG
CGTGACCGCG CCCGGCTGGC GGTGGCCGAC CTGCAGCGGG CCGGCGGCTT CGACCGCGGC
CACCTGCTCG TCACCGGCAC CACCGGGACG GGCTGGGTGG ATCCGGCGGC GATCGGCGCC
TTCGAATACG AGACCGGAGG TGACAGCGCC GCCGTGGCGA TCCAGTACTC GTACCTGCCG
TCCTGGGCAT CCTTCCTGGT CGACCAGGAC AAAGCCCGGC AGGCCGGCCG AGCGCTGTTC
GATGAGGTCT ATCGGGTCTG GTCCAGCCTT CCCCCCGACC ACCGGCCCAA GCTCTACGGC
TTCGGGCTCA GCCTCGGCTC GTTCATGATG GAGTCCCCGT TCGGCGGCGA CGCGGACATG
GCCAACCGGA CCGACGGCAT CCTGCTGGCC GGTTCGCCGG CGTTCAACCC GTTGAACCGG
GAATTCACCG ACCAGCGGGA CGCGTTAAGT CCGGAAGTAC AGCCGGTCTA CCGCGGCGGC
GAGACCGTCC GGTTCAGCAA CGATCCCGCG GCCTCCATCC CGCCGGACGA TGCGTCCTGG
GACGGCGCCA GGGTGCTGTA CCTGCAGCAC GCTTCGGACC CGATCGTGTG GCTGAGCCCG
GACCTGATTC TGCACCGGCC GGACTGGCTG GTCGAGCCGG CCGGACCTGA TGTCACCGAC
GAGATGATCT GGATACCGTT CGTCACTTTC TGGCAGGTCA CCCTCGACAT GCTCGAACCG
GTGGACACCC CGCCGGGGCA CGGCCACACC TACACGCTGG AGTTCGTCGA GGGCTGGGCC
TCGGTTCTGG AGCCCCCCAA TTGGTCCCCG GCCAAATCGG AGGAACTACG CGCGTTGCTG
ACAGAACTGC CGCACTGA
 
Protein sequence
MVDVTSNEAT AAQDREPVGD GRSNEPAEQS APASSGAVAD RGRRPRFRYT LPGSWTALVF 
VCLAFTPSLV PRPGAFQGVV GGLTGAIGYG LGVAGAWVWR QFADRPARAA RRWSWSAFGI
VAGVALVTSY LLGQRWQDQI RALVNAEPQD LGSRLILPVV AVLVFVGLVA AGRGIAKVYR
WAARRLSRWM GDRAARVVGW LLAAGLTVGL VSGVLVDGVL AITDRMFAVR DTTTSDTAVQ
PTTGLRSGGP GSLIGWDTLG YQGRNFTGSG PTPGQIQAFI GAPAPVPIRA YAGLASAQDV
RDRARLAVAD LQRAGGFDRG HLLVTGTTGT GWVDPAAIGA FEYETGGDSA AVAIQYSYLP
SWASFLVDQD KARQAGRALF DEVYRVWSSL PPDHRPKLYG FGLSLGSFMM ESPFGGDADM
ANRTDGILLA GSPAFNPLNR EFTDQRDALS PEVQPVYRGG ETVRFSNDPA ASIPPDDASW
DGARVLYLQH ASDPIVWLSP DLILHRPDWL VEPAGPDVTD EMIWIPFVTF WQVTLDMLEP
VDTPPGHGHT YTLEFVEGWA SVLEPPNWSP AKSEELRALL TELPH