Gene Ndas_2808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2808 
Symbol 
ID9246659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3353697 
End bp3354893 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003680726 
Protein GI297561752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.794185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.146653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGTCC AGACCAGAAC CGAGTCCCCT CCGCCGGAGG CCCGCAGGGC CAGAGTCGCG 
GTCTCCACCC TCTTCTTCGT CAACGGCTTC ACCTACACCA ACGCCGTCCC GTGGCTGCCG
GTGCTCAAGG CCCAGCTGGG GCTGAGCAAC ACGGAACTGG GCCTGGCGAT CGCGGCGATG
CCGACCGGCG CGATCCTGAC CGGCATGCTG GCGGGCCCGC TGATCCACTG GTTCGGCAGC
GGCCGGACGG CGGTGGGGAC CAGCCTCATC TCTCTGGGAG CGCTCCCATT CATCGCGCTG
GCGCAGAACT GGTGGATGCT GGCGGCGGCG CTGTTCGTGC TGGGCAGCGC GGACGCCTGG
ACCGACTCGG CGCAGAACTC ACACGGCCTG CGCGTCCAGC GGCGCTACAG ACGCAGCATC
ATCAACACCT TCCACGCGCT GTGGAGCATG GCCGCGGTGG CCGGAGGCCT CCTGGGCGCG
GCCATGGCCG GTACCGGCGT GCCCATCCTG TGGCACCTGG GCGGCGTGGC CGCGGTGCTG
GTGTGCGTGA ACCTGGCGGT GAGCCGGATG CTGCTGCCCG GCCCGGAGAG CAGCGAGCGC
GAGGACGGCA CGGACGCCGG GAGCGGGCGA CGCCTGCGCG TGCCCGGACG GGCCGTGCTG
CTCCTGCTGG TGCTGAGCGT GCTGCTGATG TTCGCCGGAG GCATCGAGGA CTCCGCCGCC
ACCTGGGGCG CGGTGTACAT GACCTCGGAA CTGGAGGCGT CCCTGTTCCT GGCGGCGATG
CCGTTCGTGG CCTGCCAGGC GATGATGACG CTGGGTCGCC TGGCCGGCGA CCGGGTGACC
GACCGGTTCG GCGCTGCCGC CGTGGGACGC GCGTGCGGTC TGCTGGCGGG CGGCGGAATC
GCGTTCGCCC TGCTGGTACC GAACCCGGTG GCCACGGTCA TCGGCTTCGG GGTGATGGGA
CTGGGCGTGT CCACGCTCTT CCCGCTGACC CTGGCGGCGG CGGGGAACGT GCCGGGCGTG
CGCACCGGGG ACGGGATCAC CGTCGTCGGG TGGCTGGGCC GCGCGGGCTT CCTGGCCTTC
CCGCCGCTGG TGGGCTTCCT CGCCGACTCC TCCAGCCTGG GGAACGCGCT GTGGGTGATC
GCGGGTGCCG GTGTGGGCGC CTTCCTGCTC GCCTTCGCCC TGCGTCCGCG CGTGTGA
 
Protein sequence
MNVQTRTESP PPEARRARVA VSTLFFVNGF TYTNAVPWLP VLKAQLGLSN TELGLAIAAM 
PTGAILTGML AGPLIHWFGS GRTAVGTSLI SLGALPFIAL AQNWWMLAAA LFVLGSADAW
TDSAQNSHGL RVQRRYRRSI INTFHALWSM AAVAGGLLGA AMAGTGVPIL WHLGGVAAVL
VCVNLAVSRM LLPGPESSER EDGTDAGSGR RLRVPGRAVL LLLVLSVLLM FAGGIEDSAA
TWGAVYMTSE LEASLFLAAM PFVACQAMMT LGRLAGDRVT DRFGAAAVGR ACGLLAGGGI
AFALLVPNPV ATVIGFGVMG LGVSTLFPLT LAAAGNVPGV RTGDGITVVG WLGRAGFLAF
PPLVGFLADS SSLGNALWVI AGAGVGAFLL AFALRPRV