Gene Ndas_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1093 
Symbol 
ID9244939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1342475 
End bp1343824 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003679041 
Protein GI297560067 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.672028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCGT ACCGCAGACT GCTCCACGAG TACCCCACCG GAGCCCGGCG CCGGCTCCTG 
CTCGCCGTCG TCGTCCTGGC GTTGTTCATC TCGGCCTTCG AAGGACAGCT CGCACCGGTC
CTGCCCCTGC TGCTGGCCGA CCTCGGGCTG TCCCTGGAGG TCTACGGGCT GATCACCGCC
GTGTCGCTGC TGTTCGGCGC CGTGTCCGGT TACCTGGGCG GCGAGTTGGT CGACCGGATC
GGCCGCGTCC GCGTACTGGT GCCGTTCATG TTCCTGTCCG CGGCGGCGTG CCTGTTCATG
GCGCTGTCGC AGACGGTCGT CCACTTCACC GCCGCGCGTA TCCTGCTCGC CTTCGTCGAG
GGTGTGGCGA TGGCCGGCAC CCAGCCCCTG ATCCGGGACT TCACGCCGCG GATGGGACGG
GCCCAGGCCT TCGCGTTCTG GAGCTGGGGC CCCGTCGGAG CCAACTTCTT CGCGGCGGCC
GTCGCCGCGC TGACGCTCGA CCTGTTCGAC AACTCCTGGC GCGCGCAGAT CTTCGTGATG
TCCGGCCTGG CCTTCGTGGG CGCGACGGTC GTCGCCCTCA CCCTGCGCGA CCTGGCTCCG
AACCTGCGGC GCACCATCCG CCTCACCGAG CATGCGACCC GCGGGAGCGG GGCGGCCCCC
GCCGGGCAGC GGCTGCGGCT GCTCCTGCGC CGCCGTGTCG TCTGGGCGCA CGTGGCCGCG
ATGTCGCTGC TCTACGTCCT GCTCGCCACC ATGAACGCCT ACGGCCAGAC CATGCTGGTG
GACCACTTCG CCGTCGCGGT GCGCACGGCG TCCGCGATCG TGATGTCGTT CTGGGTCAGC
AACCTGGCGG CCAGCCTGGT CTTCGCCCGG CTCTCCGACA GGGCGCAGAA CCGCAAGCCG
TTCCTGGTCC TCGGGGCGCT GGCGGCGACG CTGCTGCTCG GGGCGCTCGT CGTCACGATG
GGCGCGGGGG CGTCCGCCTC ACTGCCGCTG GTCATCGTGC TGCTGACCGG CGTCGGCCTC
TCGCTCGGCG CCGTCCTCGG CCCGTGGATG GCCAGCTTCT CGGAGTACAC CGAGGAGGTC
CACCCGGACG CCCAGGGGGT GGCCTTCGGC CTCAACCACT TCGTGAGCCG GTTGTTCATC
CTCGCCGCGG TGCTGTTCGC TCCGCAGGTG GTCGCGGTGG GCGACTGGCG GGTGTGGATG
ACGGTCACCC TGGTGACGAC AGCGGCCTTC GCGGTCGTCA CCACACAGGT CCAGGGCCGC
CTGCGGCGCG CGACCGGTTC CGCGTCGGCG GAGGAGGCCG GTGCGGCCGT CGCCGCGACC
GGGGCGCCCG AGGACTCCCG GAGTTCCTGA
 
Protein sequence
MPSYRRLLHE YPTGARRRLL LAVVVLALFI SAFEGQLAPV LPLLLADLGL SLEVYGLITA 
VSLLFGAVSG YLGGELVDRI GRVRVLVPFM FLSAAACLFM ALSQTVVHFT AARILLAFVE
GVAMAGTQPL IRDFTPRMGR AQAFAFWSWG PVGANFFAAA VAALTLDLFD NSWRAQIFVM
SGLAFVGATV VALTLRDLAP NLRRTIRLTE HATRGSGAAP AGQRLRLLLR RRVVWAHVAA
MSLLYVLLAT MNAYGQTMLV DHFAVAVRTA SAIVMSFWVS NLAASLVFAR LSDRAQNRKP
FLVLGALAAT LLLGALVVTM GAGASASLPL VIVLLTGVGL SLGAVLGPWM ASFSEYTEEV
HPDAQGVAFG LNHFVSRLFI LAAVLFAPQV VAVGDWRVWM TVTLVTTAAF AVVTTQVQGR
LRRATGSASA EEAGAAVAAT GAPEDSRSS