Gene Namu_3431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3431 
Symbol 
ID8449046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3772025 
End bp3773062 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID645042507 
Productinner-membrane translocator 
Protein accessionYP_003202747 
Protein GI258653591 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00119664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000332917 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGATC GAACCGCGAC CGATCAATCC GCCCGCCCCG AGAACTCCGT CCGCAGCGTC 
GATGTGGCTC CGGCGCCGGC CGCCGACCGC CGACGCTGGC GGGTCATCGA CCTGTGGGCC
ACCGTCGGCC CGCTGACCGT GTTCGTCATC CTGTTCGCGC TGGTGGCGAT CCTGCGGCCG
GCCTTCCTCG GCGGCGGCGG CCTGTCCATC GTGGCCACCC AGTGCACCGC GATCCTGCTG
GTCGCCCTGG GCCAGTGCCT GGTGCTCAAC GTGGGCTCGA TCGACCTGTC CAACGCGGCG
ATCGCGCTGT TCTCGGCGAT CCTGCTGGCC AAGACGATCG GCCCGGCCGG GGCCGGCGGG
CTGGTCCTGG TGATCGTGCT CGGCGCGGCC ATCGGTGCGC TCAACGGGTT CCTGGTCTCG
TTCTTCCAGG TGCCCAGCTT CGCCCTGACC CTGGGCACGC TGGGCATCCT GCAGACCGCG
TCGCTGATCA TCAGCGACAA GACCACCGTC TACGCGGCCA AGAGCGCCCT GCTCACCCCG
ATGTTCGGCT CGGCGATCGG CGGGCTGGTC ACCGCCTTCT GGACCGCGGT GATCATCGCC
ATCGTGCTCT GGGCGATGCT GCGTTTCACC ACCCTGGGCC AGAGCATGAC GGCGGTCGGG
CTGAACGAGA CCGGCGCCCT GTTCTCGGGC ATCCGGACCC GGGCCACCAA GATCATCGCC
TTCATGTTCT CCGGACTGCT GGCCTCCATC GCCGGCGTCA TGATCATCGC CCAGGCGGGA
TCGGCGTCCA GCACCGGCCT GGGCAGCGAC CTGCTGTTGC CCGGGATCAC CGCGGCGATC
GTGGGCGGCA CGGCGATCAC CGGCGGCATC ACCAATCCCA TCAACGTCGT CTTCGGCGCC
CTGACGGTCA CCCTGATTCC CGTCGGCACC GCGGCGATCG GCATCCCGTC CGAGGCGCAG
AGCCTGGTCT ACGGCCTGGT GATCATCATC GCCGTGGCCC TGACCATCAG CCGCAAGCGC
GTCGGCGTCG TGAAGTAA
 
Protein sequence
MSDRTATDQS ARPENSVRSV DVAPAPAADR RRWRVIDLWA TVGPLTVFVI LFALVAILRP 
AFLGGGGLSI VATQCTAILL VALGQCLVLN VGSIDLSNAA IALFSAILLA KTIGPAGAGG
LVLVIVLGAA IGALNGFLVS FFQVPSFALT LGTLGILQTA SLIISDKTTV YAAKSALLTP
MFGSAIGGLV TAFWTAVIIA IVLWAMLRFT TLGQSMTAVG LNETGALFSG IRTRATKIIA
FMFSGLLASI AGVMIIAQAG SASSTGLGSD LLLPGITAAI VGGTAITGGI TNPINVVFGA
LTVTLIPVGT AAIGIPSEAQ SLVYGLVIII AVALTISRKR VGVVK