Gene Namu_5348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5348 
Symbol 
ID8450981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5984743 
End bp5985867 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID645044379 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003204601 
Protein GI258655445 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGA AGTCCCCTGT CATGGTCGGA ATGGTGGCGG CCGGCGCGCT GCTGCTGGCC 
GCTTGTGGCT CCAGCTCCTC CTCATCGGGC ACCACCTCGG CGGCGGCCTC CGGGAGCGGC
ACCAGTGCGG CGGCCTCGGG TTCGGCGGCC GGCGGCGCGA CCGGCAAGGT CGGCGTCATC
CTGCCGGACA CCAAGTCCTC CGCTCGCTGG GAGACCCAGG ACCGGCCGGC GCTGGAGAAG
GCTTTCCAGG CTGCGGGTGT CGAGTTCGAC ATCCAGAACG CCAACGGTGA CAAGGCGGCC
ATGGCCACCA TCGCCGACCA GATGATCGCC AACGGGGCGA CCGTGCTGGC CATCGTGAAC
CTGGACAACG AGTCCGGCGC GGCGATCGAG AAGAAGGCCG CCAGCCAGGG CGTGCAGACC
ATCGACTACG ACCGGCTGAC CCTCGGCGGC GGCGCGGACT ACTACGTCTC GTTCGACAAC
ACCGAGGTCG GCAAGCTGCA GGGCACCGGC CTGGCCAAGT GCCTGGGCAG CGGCGACAAG
AAGATCGTCT ACCTGAACGG CTCGCCGTCG GACTCGAACG CGACCGCGTT CTCGGCCGGC
GCGCACTCGG TGCTCGACCC GATGACCAAC TACACCGTGG TCGCCGAGCA GGCCGTGCCG
GACTGGGACA ACCAGCAGGC CGGCGTGATC TACGAGCAGA TGTACACCGC GCAGGGCGGC
AAGATCGACG GCGTGCTGGC GGCCAATGAC GGCCTGGGCA ACGCGGCCAT CGCGATCAAC
AAGAAGAACG GCCTGCAGAT CCCGGTCACC GGGCAGGACG CCACCGTCCA GGGCCTGCAG
AACATCCTGG CCGGCGACCA GTGCATGACG GTCTTCAAGG ACACCAACAA GGAGGCCGCG
GCGCTGGCCA AGGTCGCCAT CGCGCTGGCC CAGGGCCAGA CCCCGCAGAC CACCGGCACG
GTCAAGGACA CCACCGGCAA CCGGGACGTG GCCGCGATCC TGGAGACCCC GGAGGCCATC
TACAAGGAGA ACGTCAAGGA CGTCGTCACG GCCGGCGGCA CCACCGCCGC CGAACTGTGC
ACCGGCGCCT ACGCCGCCGC CTGCACCGAG CTGGGCATCA GCTGA
 
Protein sequence
MRLKSPVMVG MVAAGALLLA ACGSSSSSSG TTSAAASGSG TSAAASGSAA GGATGKVGVI 
LPDTKSSARW ETQDRPALEK AFQAAGVEFD IQNANGDKAA MATIADQMIA NGATVLAIVN
LDNESGAAIE KKAASQGVQT IDYDRLTLGG GADYYVSFDN TEVGKLQGTG LAKCLGSGDK
KIVYLNGSPS DSNATAFSAG AHSVLDPMTN YTVVAEQAVP DWDNQQAGVI YEQMYTAQGG
KIDGVLAAND GLGNAAIAIN KKNGLQIPVT GQDATVQGLQ NILAGDQCMT VFKDTNKEAA
ALAKVAIALA QGQTPQTTGT VKDTTGNRDV AAILETPEAI YKENVKDVVT AGGTTAAELC
TGAYAAACTE LGIS