Gene Namu_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2107 
Symbol 
ID8447718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2324785 
End bp2325795 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID645041230 
Productinner-membrane translocator 
Protein accessionYP_003201474 
Protein GI258652318 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000157987 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.126352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGT CTGCCGTGAC CCCGCCGCCG GCGCCGGCGG TCGCCCCGGC CGAGCCCCGC 
GTACCCACCG CGACCAGGGT GATCGGCTGG ATCGCCACCA ACGGCATCTT CGTCTTCACC
GTCGTGCTCG TGGTGGGCGC ATCGCTGCTG GTCGACGGAT TCGCCTCGGC CACCAACATC
GGCGATGTGT TCCATCGGGC CGCGCCGATC GGCATCGTCG CCGTCGGCAT GACCTTCGTC
GTGATCAGCG GCAACTACCT GGACCTGTCC GTGGTTGCCC AGGTCGCCAC GGCGGCGGTC
ATCCTCATCG GGGTCAGCAA TGGCCACGGG ATCGGGCTGG CGATCCTGGC CGCGCTGGTC
GTGGCCGGGC TCTATGCCCT GGTCAACGGG GTGGCGGTGG GGTATTTCAA GGCCAACGCG
GTGATCGTCA CCCTGTCCAC CACCTATATC GGTCTGGGCG TGCTGCGCTG GCTTTCCGGT
GGGAGCATCT TCTTCGGCCC GCCCGACGGC CCGATCGCCA CCTTCGGCGA CATCAAGGTG
GGGCCGGTGC CGATCTCGGC CGTGGTACTG CTGCTGATGG CCGGGGTGCT CGGATTCGTG
TTGAGCCGCA CCACCTTCGG TTTCGTGATC CGCTCGTTCG GGTCGAACAA GGAGGCCACC
AGGCTGGCCG GAGTGGCCAC CGGTCGGGTG GTGCTGGGCG CGTTCCTGAT CACTTCGATC
TCGGCGATGG TGGCCGGCTT CGTGCTGGCC GCGTTCTCCA ACACGGCGGT GTCCTCGATG
TCGCAGGGCT ACGACTTCGG TGCCCTGGCC GCCATCATCA TCGGCGGCAC CAGCGTGTTC
GGCGGCCGGG GCAGCGTGTT GCGCACGCTG CTCGGGGTGA TCTTCGTCAG CGTGCTGACC
AACATCCTGG TGCTCGCGAA CCTGAGCTAC GGCTGGCAGC AGGTCGTGAT CGGGTCCCTG
ATCGTGCTGG CTGTTTCGGT GGACGCGCTG GCCCGGCGGG TGAGCGCATG A
 
Protein sequence
MTASAVTPPP APAVAPAEPR VPTATRVIGW IATNGIFVFT VVLVVGASLL VDGFASATNI 
GDVFHRAAPI GIVAVGMTFV VISGNYLDLS VVAQVATAAV ILIGVSNGHG IGLAILAALV
VAGLYALVNG VAVGYFKANA VIVTLSTTYI GLGVLRWLSG GSIFFGPPDG PIATFGDIKV
GPVPISAVVL LLMAGVLGFV LSRTTFGFVI RSFGSNKEAT RLAGVATGRV VLGAFLITSI
SAMVAGFVLA AFSNTAVSSM SQGYDFGALA AIIIGGTSVF GGRGSVLRTL LGVIFVSVLT
NILVLANLSY GWQQVVIGSL IVLAVSVDAL ARRVSA