Gene Namu_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4685 
Symbol 
ID8450315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5207622 
End bp5209217 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content65% 
IMG OID645043726 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003203951 
Protein GI258654795 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCGCTCCGGC GGTCCAGGAA GAGCGGGAGG CTCCGGCCTC CCGCTCCCCG 
GGCGCCGGCA ACCTGATCAT CAAGATCATT CTGGTAGGGC TGATCGACGC CCTGCTGATC
TACTGTCTGG CCCAGGCCTG GACGGCCGGG TGGTGGCCGG CGGTCGTGTT CTTCGCGATC
GTGCTGATCG CCGTCAACGC GGTCTATTTC ACCAAGGGCA ACCTGCCCTT GAAGTACCTG
ATCCCCGGTC TGGTCTTCCT GATCGTCTAC CAGCTGTTCA TGATGCTGTT CACCGCCTAC
CTGTCGTTCA CCAACTACGG CACGGGCCAC CTGGACAGCA AGGACGCGGC GATCGTGGCC
ATCCAGGCCA GCAACGTGGT GCCGGTCGAG GGCGGCACCG AGTACGCCGT GGTGCCGATC
GAGCAGAACG GCACCGTGTC CATGCTGGTC ACCGATCCGG CGACCAACCA GGTCCGGATC
GGCACCAATG AGGGACTGAC CGAGGTCCCG GCCGGTGACG TGCAGCGCGA CGGCGACCGG
GTCACCGGGG TCAGCGGCTA CCAGAGCCTG AATCTGGCCT CGCTCTCGGG CAATCCCGAC
CTCAAGGCGC AGTGGGATGC CCTCAGTCCA CCGGTCAACG CGGACGAGGG CACCTACCTA
CGGGCCATCT CGATCACCCG GGCCCGCGAG GCCAGATCCG GCTTCACGTA CGACGAGGCG
CAGGACGCCA TGATCAACAC GGCGACCGGC GAGGTCTATC TGGCGAACAA CGACGACGGT
GCGTTCATCA ACGCCACCAC CGGTCAACGG CTCAACCCGG GCTGGACGGT CGGGGTCGGC
TTCAGCAACT ACGTCAAACT GCTGACCGAC CAGACGATCC GGGAATCGTT CCTGCCGATC
CTGTTGTGGA CGTTCGCCTT CGCGATCCTG ACGACATTCC TGAACTTCTC GCTCGGGCTC
GCCCTGGCGT TGATCCTGCA GGAACGCCGA ATGCGCGGGA AGGGCATCTA CCGGGTGCTG
TTGATCATCC CGTACGGGCT TCCGGTCATC CTGACCGCGC TGGTCTGGCA GGGCATGCTC
AACGCCGACT TCGGCATCGT CAACCAGATC CTGGGGGCCA ACATCCAGTG GCTCAACGAT
CCCTGGCTGG CCAAGTTCTC GGTGCTGATG GTCAACCTGT GGATGGGCTT CCCGTACTTC
TTCCTGGTCT GTTCGGGCGC GCTGACCTCG GTCCCGGCCG ACCTGAAGGA GGCGGCGTTC
GTGGACGGCG CGTCCAGCCG GCACGCCTTC CGCACGGTGG TGCTGCCGCT GTTGCTGGTG
GCCACCGCCC CGCTGCTGGT CACCACGTTC GCGTTCAACT TCAACAACTA CACCTTGATC
AACCTGTTGA CCGGCGGCGG CCCGTTCTCG GGCTCGGCCA TCAACGGCGG GTCCACCGAC
CTGCTGATCA ACTACACGCT GCGGGTGGCC TTCACCCCGG CCAACCAGCA GATGGGCCTG
GCCTCGGCCA TCGCGATGCT CATCTTCGTC ATCGTCGGAT CGGTGTCGGC CTACGGGTTC
CGGCTCACCC GCAAACTTGA GGAGATCGGA CGATGA
 
Protein sequence
MTDTAPAVQE EREAPASRSP GAGNLIIKII LVGLIDALLI YCLAQAWTAG WWPAVVFFAI 
VLIAVNAVYF TKGNLPLKYL IPGLVFLIVY QLFMMLFTAY LSFTNYGTGH LDSKDAAIVA
IQASNVVPVE GGTEYAVVPI EQNGTVSMLV TDPATNQVRI GTNEGLTEVP AGDVQRDGDR
VTGVSGYQSL NLASLSGNPD LKAQWDALSP PVNADEGTYL RAISITRARE ARSGFTYDEA
QDAMINTATG EVYLANNDDG AFINATTGQR LNPGWTVGVG FSNYVKLLTD QTIRESFLPI
LLWTFAFAIL TTFLNFSLGL ALALILQERR MRGKGIYRVL LIIPYGLPVI LTALVWQGML
NADFGIVNQI LGANIQWLND PWLAKFSVLM VNLWMGFPYF FLVCSGALTS VPADLKEAAF
VDGASSRHAF RTVVLPLLLV ATAPLLVTTF AFNFNNYTLI NLLTGGGPFS GSAINGGSTD
LLINYTLRVA FTPANQQMGL ASAIAMLIFV IVGSVSAYGF RLTRKLEEIG R