Gene Ndas_4460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4460 
Symbol 
ID9248339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5299850 
End bp5301154 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682355 
Protein GI297563381 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAATC CGCGCCTCGA CAACACACCT GACAACGGAG ACCACGAGTC TTCGAGCGCC 
CCCAACAACG ACGACTGGCG GAGCAGCTAC CGCGACGTCA TGGCGATCGC CGAGTTCCGG
TCCGTCTGGC TCGCCCACGC GCTCTCGGTG ATCGGCACGA ACCTGCTGAA CATCGCCACC
AGCGTGCTCG TCTTCCAGAT CACCGGCTCC GCGCTGGCCG CGGGCGTCAC CCTTGCCCTG
ACCTTCCTGC CCCCGGTCAT CAGCCCCCTC ATCTCCGGAC TCGCCGACCT CCTGCCGCGC
CGCCGCGTCA TGATCGCCTG CGACCTCGCG CGCGCGGCCG TGATCGTGCT CATCGGCATC
CCCGGCATGC CGATCTGGGC CGTCTGGATA CTGCTCTTCT GCTCGGTGGT GCCGTCGGTC
CCCTTCGCCG CGGCCCGCGC CGCGATGATG GCCGAGATCG TCCAGGGCGA GCGCTACGTC
GCCAGCACGG CCATCATCCA GCTCACCTCG CAGCTGGGCA TGCTCGCCGG TCTGGTCGCC
GGCGGCCTGA TCGTCGCCGC CATCGGCCCC AACATCGCCG TGATGTCGAC CGGCGTCCTC
TTCGTGCTGT CCGCGCTCGT CGTGCTGCTG GGTGTGGCCG CGCGTCCGGC TCCCCGCGGG
GAGCGGTCGG AACGCCCGGG GTTCGTCGCC ATGACCCGCG ACGGGGCCAA GCTGGTCTTC
GCCGACCGCA GGCTGCGCAC CCTGGCGCTG CTCGCCTGGC TCGCCGGCCT GTACGGCATC
CCCTACGGGC TCGCCAACCC GATGGCCGAG GAGATCGGCG CCGGAGCGGC GGCGGCCGGG
TTCATCATGG CGGGCTCGGC GATCGGCGGG TTCGTCGGCG GGTTCGTCCT GACCCGGTTC
GTCCCGCCGC CCGTCCGCAT GCGCCTGCTC GGCCCGCTGG CCGTCCTGGC CTCGGCGCCG
CTGCTGCTGT GGCTGACCGA ACCGCCGCTG TGGCTGATGG TGTCCGCCCT CGCGCTCTGC
GGCGTCGCGG GCTCCTACCA GTTCGTGGCC AACGCGGCGT TCGTGCTGTG CGTGCCCAAG
GAGGGCCGCA GTCTGGCCTT CGGGCTGGTC GCGGCCGGGC TCCAGGCCGC CCAGGGCGTG
GGCATCCTGG TCGCCGGGTT CTTCGCCGAG AAGTTCGGCA CCGGTGTCGT CATCGCCGCG
GCCGGCGCCC TCGGCGTGGT CTGCGCGCTG CTCCTGGCCC TGCCGTGGTC GCGGATGGCC
GGCGAGACCA TCGACCGGAT GAACGCGACC GGGGGCGGCG CCTAG
 
Protein sequence
MANPRLDNTP DNGDHESSSA PNNDDWRSSY RDVMAIAEFR SVWLAHALSV IGTNLLNIAT 
SVLVFQITGS ALAAGVTLAL TFLPPVISPL ISGLADLLPR RRVMIACDLA RAAVIVLIGI
PGMPIWAVWI LLFCSVVPSV PFAAARAAMM AEIVQGERYV ASTAIIQLTS QLGMLAGLVA
GGLIVAAIGP NIAVMSTGVL FVLSALVVLL GVAARPAPRG ERSERPGFVA MTRDGAKLVF
ADRRLRTLAL LAWLAGLYGI PYGLANPMAE EIGAGAAAAG FIMAGSAIGG FVGGFVLTRF
VPPPVRMRLL GPLAVLASAP LLLWLTEPPL WLMVSALALC GVAGSYQFVA NAAFVLCVPK
EGRSLAFGLV AAGLQAAQGV GILVAGFFAE KFGTGVVIAA AGALGVVCAL LLALPWSRMA
GETIDRMNAT GGGA