Gene Ndas_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3679 
Symbol 
ID9247548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4415753 
End bp4416985 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003681583 
Protein GI297562609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.709991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCA CTGGCGGCGC GGGTGCGGGA CTGACCGCCC CGCTGGGGCA CCCGGCCTTT 
CGCGGGCTGG TCGCGGGCCG GACCCTGATG GCCCTGGGCA ACGGCATGGC GTCGGTGGCG
CTGGCCTTCG CCGTGCTGGA CGTGACCGGC TCGCTGACGG GCGTGGGGCT GGTGGTGGGC
GCGCGCTCGG TGGCGAACGT GGCGCTGCTC CTGCTCGGCG GCGTCATCGC CGACCGGCTG
CCGCGCGCGC TGGTGCTCCA GGGCGGGTGC GCCCTGGCCG CGCTCTCCCA GGCGGTCCTG
GGCGCGCTCC TGTTGACGGG CACCGCCTCG CTGCCGCTGA TGATCGCGCT GAGCCTGGTC
AACGGCGCGG CCGCGGCGGT GAACCTGCCC GCCTCCGCGG CCCTGACGCC GCAGACCGTG
CCCCGGGAGC TGCTGCGCCA GGCCAACGCG GCGATGGGCG TGGGCGTCCA GGGCGGGCTG
TTCCTGGGGA TGTCGGCGGG CGGCGTCGTG GTGGGCCTGC TCGGCGCGGG CTGGGCGATC
ACGGCGGACG CGGCGCTGTT CGCCTGCGCC GGGACGGCCT TCCTGTCCGT ACGCGCGGCA
CGGGCGGACG GGCCGGTGGA CACCGGGGAC GCGGACGTGC TGCGCGACCT GCGGGAGGGG
TGGAGCGAGT TCGTGTCGCG GCCCTGGGTC TGGATCATCG TGCTCCAGTT CATGGTCGTC
AACGCGGGCT GGTCGGCGGC CACGGCCGTG CTGGGTCCGG GCATCGCCGA CGAGACCTTC
GGCCGGACCG CCTGGGGCCT GCTCATGGCG GCCAACAGCG TCGGGCTGCT GGCGGGCGGC
GTGCTGGCGG CCCGCTGGCA GCCGCGGCGG GCGCTGGTCT ACGGCACGGC CCTGATGACG
GCGCACGCCG TGCCCTTCCT GGCCCTGACC GGGCCCTCGC CGCTGCCGGT GCTGTTCGCC
GCGATGTTCC TGGCGGGCGT GGCGGTACAG CAGTTCGACG TGGCCTGGGA GGTCGCCGTC
CAGGAGAACG TGCCGAAGGA GAAGCTGTCG CGGGTGTACT CCTACGACGC GCTGGGCTCG
TTCGTGGCGA TGCCCCTCGG GCAGGTCGCG ATCGGCCCGT TCGCCGAGCG CTTCGGCCCG
GCCCCGGCCC TGGTGCTGGT GGCCTCGCTG ACGCTGCTGG CCACCCTGGC CGCGGTCTCC
TCGCGCAGCG TGCGCACCCT GACCCGCTCC TAG
 
Protein sequence
MARTGGAGAG LTAPLGHPAF RGLVAGRTLM ALGNGMASVA LAFAVLDVTG SLTGVGLVVG 
ARSVANVALL LLGGVIADRL PRALVLQGGC ALAALSQAVL GALLLTGTAS LPLMIALSLV
NGAAAAVNLP ASAALTPQTV PRELLRQANA AMGVGVQGGL FLGMSAGGVV VGLLGAGWAI
TADAALFACA GTAFLSVRAA RADGPVDTGD ADVLRDLREG WSEFVSRPWV WIIVLQFMVV
NAGWSAATAV LGPGIADETF GRTAWGLLMA ANSVGLLAGG VLAARWQPRR ALVYGTALMT
AHAVPFLALT GPSPLPVLFA AMFLAGVAVQ QFDVAWEVAV QENVPKEKLS RVYSYDALGS
FVAMPLGQVA IGPFAERFGP APALVLVASL TLLATLAAVS SRSVRTLTRS