Gene Namu_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2109 
Symbol 
ID8447720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2326876 
End bp2327874 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID645041232 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003201476 
Protein GI258652320 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0292423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0717969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCA CCCGTGTCGC TGTCGCGACC CTGTCCCTGA TCGCCGCCGC CGCCCTGGTT 
GCCGGTTGCT CCTCCGGCTC GTCCAGCGCG TCGGGCAGCA GCAGTGGTGC CGCAGAGGGC
AAGAAGGTCT ACGCCCTGCT GCCGCAGGGC ACCGACCAGC CCTACGGCAC CGAGTACCTC
AAGGCGATGC AGGCCGAGGC CGACAAGGAC GGCATCGACC TGACCATCAC CAACTCGCAG
TACGACGCCG ACAAGCAGGC CAGCGACTGC CAGGTCGCGG TGGCGGCCAA ACCGAATCTG
ATCATCCTGT GGCCCGCGGT GGCCGATGCG GTCCGGCCCT GCCTGGAGCG GGCCAAGGCG
GCCGGAATCC CGGTGACGGT CACCAACTCC GACGTCGAGG CCGACGACAA GTCCCTGGTC
GTCGCCTATT CCGGCCCCGA CACGATCGGT CAGGGAGCCG CGTCGGCCGA GATCATGTGC
GATCTGGCCA AGGGGCAGGC CCTGAACATC CTGGAGATCG ACGGGCTCAC CGGCAACACC
ACCGCCATCA ACCGAGCCAA GGGCTTCGCC GACACCATCG CCAGCACGTG CCCGAACGTC
AAGGTGCTGG CCGCCCAACC CGGCGACTGG AACAAGGACG ATGCGCAGAC CGTGACCTCG
GAAATGCTGA CCTCGGTCGG CGCGGCCAAC GTCCAGGGCA TCTACGCCGC GGACGACACC
ATGGTGGCCG GCGCGATCGA CGCGCTCAAG GCGCAGAACA TCGACCCGAA GTCGTTGATC
ATCACCTCCA TCGGCAACAC CAAACTGGGT AATCCGCTGG TGATCTCGGG TGAGCTGGAC
GGCACCGTCT TCCAGTCCTC CTCGTGGGAC GGGCAGAACG CGATCGTGGT CGCCAACAAG
GTGCTCTCGG GGGAGCAGGT CTCCGGCGAT CTGTTCATGC CCTCGGTCAA GGTGACCTCG
GCCAACGCGA CGGACCCCTC CGTCACCCCG GAGTGGTAA
 
Protein sequence
MRATRVAVAT LSLIAAAALV AGCSSGSSSA SGSSSGAAEG KKVYALLPQG TDQPYGTEYL 
KAMQAEADKD GIDLTITNSQ YDADKQASDC QVAVAAKPNL IILWPAVADA VRPCLERAKA
AGIPVTVTNS DVEADDKSLV VAYSGPDTIG QGAASAEIMC DLAKGQALNI LEIDGLTGNT
TAINRAKGFA DTIASTCPNV KVLAAQPGDW NKDDAQTVTS EMLTSVGAAN VQGIYAADDT
MVAGAIDALK AQNIDPKSLI ITSIGNTKLG NPLVISGELD GTVFQSSSWD GQNAIVVANK
VLSGEQVSGD LFMPSVKVTS ANATDPSVTP EW