Gene Ndas_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1066 
Symbol 
ID9244912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1312342 
End bp1313832 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679014 
Protein GI297560040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.402473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA CACGCCGGTG GCTGCTGCTG GTGACCGTGG CGGCCGGACT GCTGCTCGTC 
ACCGTGGACA ACACGATCCT GTACACTGCC CTGCCGACCC TGACCGCTCA ACTGGGGGCG
ACGGGCGCGC AGAGCCTGTG GATCATCAAC GCCTACCCGG TGGTCATGGC CGGTCTGCTG
CTGGGCAGCG GCACGCTGGG CGACCGGATC GGCCACGTGC GCATGTTCGT GGTCGGGCTG
GTGGTCTTCG GCCTGGCCTC GCTGGTCGCC GCGTTCTCCC CCACGGCCTG GGTCCTCATC
GCCTCGCGCG CGCTGCTGGC GGTGGGCGCG GCGGCGATGA TGCCCGCCAC CCTGGCGCTG
ATCAGGATCG CCTTCCCCAT CGAGCGCGAA CGCAACATCG CCATCGCGGT CTGGGGCAGC
GTCTCGGTGG TCGGCGGCGC GCTGGGGCCG ATCGTGGGCG GGGTCCTGCT GGAGTTCTTC
TGGTGGGGGT CGGTCTTCCT CCTCAACGTG CCCGTGGTCA TCGCGGCGCT GGTGGCGACC
GCGCTGATCG CGCCGCCCAA CGTCCCCGAC CCCGGCAAGC ACTGGGACCT GGTCTCCTCG
CTCCAGGCCA TGGCCGGGCT GGTCGCCTCG GTGCTGGCGA TCAAGGAGCT GGCGCACACG
CCGCCGCGCT GGCCGCTCTT CGCCGCCGCC GTGGTCGTGG CCGTGGTGGC CTTCGTCCTG
TTCACGCGCC GCCAGCGCCG TCTGGAGGAC CCGCTGCTGG ACTTCGGGGT CTTCCGCAAC
CCCGCCTTCA CCTCCGGTGT GCTGGCCGCG GCGTTCTCGA TGTTCGCCAT CGGCGGCATC
CAGCTCGTCA CCACCCAGCG CTTCCAGCTG GTCGTGGGCT TCACGCCGCT GGAGGCCGGG
CTGCTGGTGG CCGCCGTGGC GGCGGGTTCG CTGCCGACCG CGCTGCTGGG CGGGGCGTTC
CTGCACCGGA CCGGCCTGCT GCCCCTGATC GCGGGAGGTC TCGCCGCAGG CGTGGCGGGT
GTGGTCGTCT CGCTCCTGGG CTTCCAGACG GGCATCGGCT GGCTCGTCGC GGGGCTGCTG
CTGACCGGCG CCGGACTGGG TGCGGCGATG TCGGTGGCCT CCACGGCGAT CATCGGCAAC
GCGCCCGCCA GCCGCGCCGG GATGGCCTCC TCGGTGGAGG AGGTCTCCTA CGAGTTCGGC
AACCTGACCG CGGTGGCCCT GATGGGCAGC CTGGTCACCT TCGTCTACGC GGCCGCCGTC
CAGCTGCCGC AGGGCGCCCC GGAAGCCGCC GGGAGGAGCC TGGCCGACGC CCTGGCCTCG
GCCGGGGGCG ACGACGCGGT GGTCGCCGCG GCCCACGCCG CCTTCGACAC CGGCTACCTG
GTGGTCATGG TCGTGGTCGC GGCGGTCCTG GCGTGCGGCG CGGCCCTCAC CTGGCGGCTG
CTGCGCCACC ATGGTCCCGG CACGTCCTCG TCGGCGTACG CGGACCACTG A
 
Protein sequence
MTATRRWLLL VTVAAGLLLV TVDNTILYTA LPTLTAQLGA TGAQSLWIIN AYPVVMAGLL 
LGSGTLGDRI GHVRMFVVGL VVFGLASLVA AFSPTAWVLI ASRALLAVGA AAMMPATLAL
IRIAFPIERE RNIAIAVWGS VSVVGGALGP IVGGVLLEFF WWGSVFLLNV PVVIAALVAT
ALIAPPNVPD PGKHWDLVSS LQAMAGLVAS VLAIKELAHT PPRWPLFAAA VVVAVVAFVL
FTRRQRRLED PLLDFGVFRN PAFTSGVLAA AFSMFAIGGI QLVTTQRFQL VVGFTPLEAG
LLVAAVAAGS LPTALLGGAF LHRTGLLPLI AGGLAAGVAG VVVSLLGFQT GIGWLVAGLL
LTGAGLGAAM SVASTAIIGN APASRAGMAS SVEEVSYEFG NLTAVALMGS LVTFVYAAAV
QLPQGAPEAA GRSLADALAS AGGDDAVVAA AHAAFDTGYL VVMVVVAAVL ACGAALTWRL
LRHHGPGTSS SAYADH