Gene Namu_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3971 
Symbol 
ID8449590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4383266 
End bp4384318 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID645043016 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003203252 
Protein GI258654096 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.182523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0264597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTT CCAAGGTCCT CACTGCCACC GCGGTCGGCG CGTTCGCCGT CATGCTCACC 
GTGGCGGGCT GCTCGTCCTC CAAGCCGGAA TCCAGCGCGG GCACCTCGGC CGGTTCGGGC
TCGGCCGCGG CGGCCACCGC CAGCGCCGCG ACCGGGTCGG TGGCCGCGCC GACCAAGGCC
GGCAAGGACT ACAACGTGGC GTTCATCCAG GGTGTCGCCG GCGACGAGTT CTACATCACC
ATGCAATGCG GCATCGAGGC CGAGGCGGCC AAGCTGGGCG TCACGGTGAA CACGCAAGGC
GGCCAGAAGT TCGACCCGAC GCTGCAGACC CCGATCCTGG ACTCGGTCGT GGCCAGCAAG
CCCGACGCGA TCCTGATCGC GCCGACCGAT GTCACCGCCA TGCAGAGGCC GCTGGAGAAC
GCGGCCGCCG CCGGCATCAA GGTCGTCTTG GTCGACACCA CCACCGAGGA CCCGTCGTTC
GCCGTCTCCC AAGTCTCCTC GGACAACGAA GGCGGCGGCG CCGCCGCGTT CAAGGCCATC
AAGGACAAGA ACCCCAACGG CGGCAAGGTG CTGGTCATCT CCACCGACCC CGGCATCTCT
ACCGTCGACG CCCGGGTGAA GGGCTTCGAG GACGCGGTCG GCAAGGATTC CACGTTCGAC
TACCTGGGCG TGCAGTACTC GCACAATGAC CCGGCCACGG CCGCCCAGCT GGTCACCGCG
GCCCTGCAGA AGGACCCCGA CATCGTCGGC ATCTTCGCCA CCAACATCTT CTCCGCGGAG
GGCTCGTCCA CCGGCGTCAA GCAGGCCGGC AAGAGCGACC AGATGACGAT CGTCGGCTTC
GACGCCGGCC CGAACCAGGT CAAGGCGCTC AAGGACGGCA CCGTCCAGGC GCTGGTCGCG
CAGCAGCCCG CCACCATCGG CACCGACGGG CTGGATCAGG CGATCGCCTC GCTCGACGGC
GGCACCATCA CCCCCAAGAT CCAGACCGGC TTCACCATCA TCACGGCCGA TAACGTCGAC
TCCTCGGATG CGGTCTACAA GTCCTCCTGC TGA
 
Protein sequence
MKRSKVLTAT AVGAFAVMLT VAGCSSSKPE SSAGTSAGSG SAAAATASAA TGSVAAPTKA 
GKDYNVAFIQ GVAGDEFYIT MQCGIEAEAA KLGVTVNTQG GQKFDPTLQT PILDSVVASK
PDAILIAPTD VTAMQRPLEN AAAAGIKVVL VDTTTEDPSF AVSQVSSDNE GGGAAAFKAI
KDKNPNGGKV LVISTDPGIS TVDARVKGFE DAVGKDSTFD YLGVQYSHND PATAAQLVTA
ALQKDPDIVG IFATNIFSAE GSSTGVKQAG KSDQMTIVGF DAGPNQVKAL KDGTVQALVA
QQPATIGTDG LDQAIASLDG GTITPKIQTG FTIITADNVD SSDAVYKSSC