Gene Ndas_2125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2125 
Symbol 
ID9245975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2543681 
End bp2544958 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content77% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680055 
Protein GI297561081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC TGTCCCCGCG TGCCCCCGGC ATGCGCGCGT CCCGTCTGAC CCGACCCCTG 
TACGCCTACG CGGCCCTGGG GGAGTTCGTC CTCCTCTACC CGCTGTACGC GCTGCTGTTC
GACGAGACCG GCCTGACCGT CGCCCAGATC AGCTCCCTGT TCGTGATCTG GTCACTGTCC
TCGGTCCTCG CGTCCGTCCC CGCCGGGGCC TGGGCCGACG TGGTGCCGAG GCGCTACCTG
CTGGCCGCCG CGCCGCTCCT GGCCGCGTGC GCGTTCGTGC TGTGGCTGCT GTGGCCCGGC
TACTGGACCT TCGCGCTCGG CTTCGTCCTG TGGGGCGCGG GCGGCGCGCT GTCCTCGGGC
TCCCTCGAAG CCCTCGTCTA CACCGAACTC GCCCGCCGCG ACGCGTCCGA CCGCTACGCC
CGGGTCATGG GCGTGACCCG CGCCCTGGAC GTGGCCGCCG TCGGCGCCGC CACCCTGCTG
GCCATCCCCG TCATGGCCCA CGGCGGCTAC GCGGCGGTCG GCGCGGCCAG CGTCGCCGTC
TGCGTGCTGT GCGCCCTGGC GGCCCTGGCC CTGCCCGAGC ACCGCGCCGC CGAGGGCGGC
CCCGCCTCCG CCGACGGCGC CGCGGGGGCC GGGGAAGCGG AGGAGGGGCC CTCCGGCTAC
CTGGACGCCC TGCGCGGCGG ACTGGCCGAG GCCCGCTCCA GCCGGCGGGT GCGCGGCGCG
ATCGTGCTGG TGGTCGCGGT GACCGCGTTC TGGGAGGCGC TGGACGAGTA CGTCCCCCTG
CTCGTCGCCG AGTCCGGCGT CCCCGCCGCC ACCGTTCCCG TGGCGGTCCT GGTGGTGTGG
GCCTTCGTGC TCCTGGGCGG GCTGCTGGCG GGGCCCGCCT CCCGGCTGCC CGTACGGGCC
CTGGCCGTCC TGGTGGGCCT GGCCGCGCCC GCCATCGCCG CCGGAGCCCT GCTGGACGGC
CCGGTGGGAT GGGTGCTGAT CGGCGTGGGC TTCGGCCTGT GCCAGACCGC GGGCGTGGTC
GCCGACGCCC GCCTGCAGGA CCGCATCACC GGGTCGAGCC GGGCCACCGT CACCTCCCTG
GCCGGTCTGG GCACCGACGC CCTCACCACG GCCTCCTACC TGCTGTACGC GGGGGTGTTC
GCGGTGACCG GGCACGCCCT GGCGTTCGCC CTGTTCGCAC TGCCCTACCT GGTGGCGGCG
CTCTTCCTGG GCCGCGGTAC CCGCGGGGCC CGTCCGGTCC GCCGCTCCCC TGACCGGAAG
GGGGAGGCGC GGCGGTGA
 
Protein sequence
MTTLSPRAPG MRASRLTRPL YAYAALGEFV LLYPLYALLF DETGLTVAQI SSLFVIWSLS 
SVLASVPAGA WADVVPRRYL LAAAPLLAAC AFVLWLLWPG YWTFALGFVL WGAGGALSSG
SLEALVYTEL ARRDASDRYA RVMGVTRALD VAAVGAATLL AIPVMAHGGY AAVGAASVAV
CVLCALAALA LPEHRAAEGG PASADGAAGA GEAEEGPSGY LDALRGGLAE ARSSRRVRGA
IVLVVAVTAF WEALDEYVPL LVAESGVPAA TVPVAVLVVW AFVLLGGLLA GPASRLPVRA
LAVLVGLAAP AIAAGALLDG PVGWVLIGVG FGLCQTAGVV ADARLQDRIT GSSRATVTSL
AGLGTDALTT ASYLLYAGVF AVTGHALAFA LFALPYLVAA LFLGRGTRGA RPVRRSPDRK
GEARR