Gene Namu_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4920 
Symbol 
ID8450551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5493263 
End bp5494399 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content69% 
IMG OID645043959 
ProductABC transporter related 
Protein accessionYP_003204183 
Protein GI258655027 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG TCAAGTACGA CGGAGCGTCC CGGGTCTACC CGGGTAGCCC GCCGGTGCGC 
GCGGTGGACA CCCTGAACCT GGAGATCCCG GACGGCGAGT TTCTCGTCCT GGTCGGTCCC
TCGGGCTCGG GCAAGTCCAC GGCGCTGCGC ATGCTCGCCG GCCTGGAGGA CGTCAACGAG
GGCCGCATCC TGATCGGCGG CCGCGACGTC ACCCACGTCC CGCCGAAGAG CCGGGACATC
GCCATGGTGT TCCAGTCCTA CGCGCTGTAC CCGCACATGA CCGTGGCCGA GAACATGGGC
TTCGCGCTCA AGCTGCGCGG CGTGAACAAG ACCGAGATCG CCGAGAAGGT CAAGGAGGCC
GCCGGCCTGC TGGATCTGGA GAAGTACCTG GACCGCAAGC CCAAGGCCCT CTCCGGTGGC
CAGCGCCAGC GCGTGGCCAT GGGCCGCGCC ATCGTCCGCG AGCCGTCCGT GTTCCTCATG
GACGAGCCGC TGTCCAACCT GGACGCCAAG CTCCGGGTGG AGACCCGCGC CAACATCGCC
GAGCTGCAGT CCCGGCTGGG CACCACGACC GTCTACGTCA CCCACGACCA GGTCGAGGCC
ATGACCATGG GCCACCGGGT GGCGGTGCTC AAGGACGGCA TCCTGAACCA GGTCGACACC
CCGCGGACCC TGTACGACAA GCCGGTCAAC GTCTTCGTCG CCGGGTTCAT GGGCTCTCCC
GCGATGAACC TGCTGACCGT GCCGCTGGTC GGCGACGGGG CCCAGCTGGG TGACGCGACC
TTCGCGCTGC CCCGCAGCGT GCTGTCCGAG GCGTCCGCCG CGGGCCTCAA GGAGGTCACC
TTCGGTGTCC GTCCGGAGAA CATCACGCTC GCCGACAAGG GCATCCCGGT CACCATCGAC
CTGGTCGAGG AACTCGGCGC CGACGCCTTC GTGCACGGCC ACACCCCGGA CGGCAACCGC
CTGGTGATCC GCGCCGACGC GCGGATCCAC CCGTCGCAGG GATCGACCGT CTACGCCCTG
CCGACCGACG CCGACCACTG CCACGTGTTC GACCCGAACA CCGGCATCCG GTTCAAGTCC
TCGGACGCGG TCCCGGCCGA CGCCGCCAAC AGCAGCATCA CCCTCGGCAA GAGCTGA
 
Protein sequence
MADVKYDGAS RVYPGSPPVR AVDTLNLEIP DGEFLVLVGP SGSGKSTALR MLAGLEDVNE 
GRILIGGRDV THVPPKSRDI AMVFQSYALY PHMTVAENMG FALKLRGVNK TEIAEKVKEA
AGLLDLEKYL DRKPKALSGG QRQRVAMGRA IVREPSVFLM DEPLSNLDAK LRVETRANIA
ELQSRLGTTT VYVTHDQVEA MTMGHRVAVL KDGILNQVDT PRTLYDKPVN VFVAGFMGSP
AMNLLTVPLV GDGAQLGDAT FALPRSVLSE ASAAGLKEVT FGVRPENITL ADKGIPVTID
LVEELGADAF VHGHTPDGNR LVIRADARIH PSQGSTVYAL PTDADHCHVF DPNTGIRFKS
SDAVPADAAN SSITLGKS