Gene Ndas_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0983 
Symbol 
ID9244828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1203098 
End bp1204543 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID 
ProductGeneral substrate transporter 
Protein accessionYP_003678933 
Protein GI297559959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.359093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAGC CGCAGCCGCA GGGGCGGACG GCCCTGACCA CGGACCGCAG GGCGACGCGC 
AAGGCCGTGG TGGCCGCGGC GATCGGCAAC GCGACCGAGT GGTACGACTT CGGCGTCTAC
AGCTACCTGG CCGTCACGAT CGGCCTGGTG TTCTACCCGG CGCAGACCCA GGGCACCCAG
CTCATCGCCA CCTTCACCAC CTTCGCCGCC GCCTTCCTGG TGCGGCCCCT GGGCGGGCTG
TTCTTCGGCC CCCTCGGCGA CAGGATCGGC CGCAAGCGCG TCCTGGCCTT CACGATGCTG
CTCATGGCGG TGAGCACGTT CTCGATCGGG CTCATCCCCT CGGCCGCGAG CATCGGCTTC
GCCGCGCCCG TGCTGCTGCT GGTCGCGCGG ATGCTCCAGG GCTTCTCCAC CGGCGGCGAG
TACGGCGGCG CCACGACCTT CATCGCCGAG TACGCGCCCG ACCGGCGGCG CGGCTTCCTG
GCCTCCTGGC TGGAGTTCGG CACGGTCAGC GGCTACGTCG GCGGCGCCAC CGTCGTCACG
GTGATGACCC TCCTGCTCGG TTCGGACACC ATGCAGGACT GGGGGTGGCG CGTCCCCTTC
CTGGTCGCGC TGCCGCTGGG CGCCGTCGGC CTGTACCTGC GGGTGAAGCT GGAGGACACC
CCCGTCTTCG AGCAGAACAC CGGGGGCTAC GCCAAGGACT CCCACGGCGG GCACCGCGAG
GGACAGCTGC GGGCGACGGT CGTGGACCAG TGGCGCCACA TCCTGCTGTG CGTGGGCCTG
GTGATGGTCT TCAACGTCAA CAACTACGTC CTGACCGCGT ACATGCCCAC CTACCTGGAG
GCGGAGCTGG GGTACGGCCC CACCACGGCT CTGGTGCTGA CGCTGGCGGC GATGGTGCTG
ATGCTGTTCG CGGTGACCGG GTTCGGACAC CTGAGCGACC GCGTGGGGCG CAGGCCCGTG
CTGCTCTCGG GCAGCCTGTT CTCGATCGTG CTGGCCCTGC CCGCCTTCTG GCTGCTGCAA
CAAGGGGGCC CGTGGACGGT GGCCCTGGGC ATGGTGGTGC TGGCGGTGAC CCTGGTGCAC
TTCTCCGGCG GCGCGCCCGC GGCGCTGCCG GCGTTCTTCC CCACCAGCGT GCGCTACGGC
GCGCTGGCCA TCAGCTTCAA CGTGTCGGTG GCGCTGTTCG GCGGCACCAC CCCGCTGGTC
GCCGAGGCGC TGGTGCAGGC CACCGGAAAC CTCTACGCAC CGGCGTGGCT GGTGATGGTC
GCGGGAGTGG TGGGGCTGGT GGTGGTGTGG CGGATGAAGG AGAGCGCGAA CCGCCCGCTG
CCCGGCGCCC CCGCGATCCC CGTCCCCGGG GAGGAGGGGG GCCGACCGCC CCGCTCCCGC
AAGGGGGGAA CGACGGGGAG CCACCCGCCG CAGGGCAACG TCCGCCGCCT GAGCGGGGAG
GCCTGA
 
Protein sequence
MNEPQPQGRT ALTTDRRATR KAVVAAAIGN ATEWYDFGVY SYLAVTIGLV FYPAQTQGTQ 
LIATFTTFAA AFLVRPLGGL FFGPLGDRIG RKRVLAFTML LMAVSTFSIG LIPSAASIGF
AAPVLLLVAR MLQGFSTGGE YGGATTFIAE YAPDRRRGFL ASWLEFGTVS GYVGGATVVT
VMTLLLGSDT MQDWGWRVPF LVALPLGAVG LYLRVKLEDT PVFEQNTGGY AKDSHGGHRE
GQLRATVVDQ WRHILLCVGL VMVFNVNNYV LTAYMPTYLE AELGYGPTTA LVLTLAAMVL
MLFAVTGFGH LSDRVGRRPV LLSGSLFSIV LALPAFWLLQ QGGPWTVALG MVVLAVTLVH
FSGGAPAALP AFFPTSVRYG ALAISFNVSV ALFGGTTPLV AEALVQATGN LYAPAWLVMV
AGVVGLVVVW RMKESANRPL PGAPAIPVPG EEGGRPPRSR KGGTTGSHPP QGNVRRLSGE
A