Gene Ndas_4568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4568 
Symbol 
ID9248449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5412897 
End bp5414282 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content76% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682461 
Protein GI297563487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCACAC GCGACCGCGG CGGGATACTC CTCCTCGCCG TCCTGCTCTC CACGCTCACC 
TTCCCGCTGG CCATCACCGG TGCCTCGGTG GCCCTGCCCG CCGTCCAGGC CGAACTCGGC
GCCACGCTGA CCGCCGCCCA GTGGGTCGTC AACTCCTACA ACGCCTGCTT CGCCGCCTTC
CTCGTGTGCG CGGGGTCGGT CGCCGACGCG GTCGGCCGCC GCCGGGTCTA CGCCTTCGGC
CTCGCCCTGT TCTGCGCCAG CGGTCTGCTG TGCCTGCTCG TCCGGGACGT CACCGCCCTG
AACCTGCTGC GCGCGGCGGG CGGCGTCGGC GCCGCGGCGG CGGTCGCCGG CGGCAGTTCG
ATGATCGCCG CGGCCTTCGA GGGCCCCGCG CGGGCCCGGG CCTTCGGCCT GCTGGGCACG
GTGCTGGGCG CCGGTACCGC GTTCGGCCCG GCCGTGGCCG GACTGCTCGT CGAGAACCTC
GGCTGGCGGG CCGCGTTCGC CTCCCCGGCC GCCGTGGCCG GGCTCGTCCT CCTCCTCGTG
CCCCTCCTGC CCGCCGCGCG CGGCACGGGC AGGCCCGTGG ACCGGCTCGG AGCCGTCCTG
TTCACCTCGG CACTGCTCGC CTCGATCGCC GTCCTGGTGG AGGGACCCGC GCGCGGCCTG
CCGACGGTGC TCGCGGGGCT GGCCCTGGTC GCGGCTCTGG CGATCGCCTT CGTGCTCGTC
GAACGCCGGG CCAGCGACCC CCTGGTGGAC CTGGTCCTGC TGGCCAACCG CCGGTTCGTC
GCCCACGCCC TGGCCGCCGC CGCGTTCATG GCGGTGCTCG TGCCCCTGCT CGTGTACCTG
CCGTCGTACC TGATCGCCGT GGTGGGGCTG GGCGCCGGTC AGGCCGGGCT GTGGCTGCTG
ATGCTCACGC TGCCCACCCT CCTGCTGCCC GCCCCGGGCG CGGAACTGGC CGCGCGCCTG
CCCCACACCG CGGTCGTGGT CGGCGCCCTG CTGCTGTGCG CCGCCGGAGC CACGGGGCTG
CTCGCGCTGG GGCCCGACGC CACACCGTGG CGGCTGCTCC TGCCGTTCCT GCTGGTCGGA
GCCGGGGTGG GCCTCACCAA CGGCGTCGTG GACGGAATGG CCATGGGTGC GGTGCCCGCC
GAGCGGACGG GCGTCGCGGC CGGGGTGTTC AACGCCTCCC GGATCACCGT GGAGACGGTC
GCCCTCGCCG CGGTCGGGGC ACTCCTGGCC GCGCTCACCG GGGGACGCCT GGAGGGCGAG
CGGTTCACGG ACGCGTTCCA CGTCGTGGGC CCGGTCCTGG GCGGACTCGC CGTCCTGGCG
GCGGCCGCGG CCTGGTCCCT CGGAAGAAGG AGGACAACGG AACCGCACGG CCCGCCCCTT
CGCTGA
 
Protein sequence
MSTRDRGGIL LLAVLLSTLT FPLAITGASV ALPAVQAELG ATLTAAQWVV NSYNACFAAF 
LVCAGSVADA VGRRRVYAFG LALFCASGLL CLLVRDVTAL NLLRAAGGVG AAAAVAGGSS
MIAAAFEGPA RARAFGLLGT VLGAGTAFGP AVAGLLVENL GWRAAFASPA AVAGLVLLLV
PLLPAARGTG RPVDRLGAVL FTSALLASIA VLVEGPARGL PTVLAGLALV AALAIAFVLV
ERRASDPLVD LVLLANRRFV AHALAAAAFM AVLVPLLVYL PSYLIAVVGL GAGQAGLWLL
MLTLPTLLLP APGAELAARL PHTAVVVGAL LLCAAGATGL LALGPDATPW RLLLPFLLVG
AGVGLTNGVV DGMAMGAVPA ERTGVAAGVF NASRITVETV ALAAVGALLA ALTGGRLEGE
RFTDAFHVVG PVLGGLAVLA AAAAWSLGRR RTTEPHGPPL R