Gene Ndas_3402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3402 
Symbol 
ID9247269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4068167 
End bp4070035 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003681313 
Protein GI297562339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGA TGTCCGGCAG GCCCTCCTAC AGGGCCCTGG TCAACGACGG GAGCGCGAAG 
GGGCAGTCAC TGCCCCCCGG AATCACCCGA CGCATCATCT CCTACGCCCG CCCGCACTGG
CGGGTGATCC TGTGTTTCCT GTTGGTGACG ACGGTCGGCG CCGGGATCGT GGTCGCCAAC
CCGCTCCTGC TCAAGGCCAT CATCGACCGC GGCATCCTGA CCGGGAACAC CGCGCTGGTG
GTGTGGCTGG CGCTCGCCGC GGCGGGGCTG GCGGTGCTGG AGAGCGGACT GACCCTGCTC
GGCAGGTGGC TGTCCTCCCG GATCGGCGAG GGGGTCATCT ACCAGCTGCG CACGCAGGTG
TTCACCCACG TCCAGCGGAT GCCGGTGGCC TTCTTCACCC GTACGCAGAC GGGGTCGCTG
ATCAGCCGTC TGAACACGGA CGTGGTGGGC GCGCAGCGGG CGATCACCTC CGTCCTGCAG
TCGGTGGTGT CCAACGTGGT GAGCGCGACC GCGGTGATCG TGACGATGAT CGCCCTGTCG
TGGCAGGTGA CGCTGATCGC GCTGGCCCTG GTTCCGCTGT TCGTGGTCCC CGCCAAGGTG
ATCGGGCGAC GGCTGGCGCA CATCTCCCGC GACGCGATGG ACACCAACGC GGACATGAGC
TCGCTGATGA CCGAGCGGTT CAACGTCGGC GGGGCGATGC TGGTCAAGCT GTACGGACGC
CCCGAGGAGG AGTCGGCGGG GTTCGCCCGG CGCGCGAGCC GGGTCCGGGA CCTGGGCGTG
CGCCAGGCGG TGTTCGGCTC GCTGCTGTTC AGCATGCTGG GCCTGATCAC GGCGCTGGCC
ATGGCGATGG TCTACGGGGT CGGCGGCGTG CTGGCGATCG GCGGGGCCTT CGAGGTGGGC
ACGCTGGTGG CGCTGACCAC CCTGCTCGCC CGCCTGTACG GGCCGGTGAC GACGCTGTCC
AACGTGCACG TGGAGATCAT GACCGCGCTG GTCTCCTTCG ACCGGGTCTT CGAGGTGCTG
GACCTGGAAC CGGGCATCAG GGAGAGCCCC GACGCACGGA ACCTGCCCGG GGAGCGTCTG
GGCGTGGAGT TCGACAACGT GTCGTTCCGC TACCCGGCGG CGAAGGAGTC GTCGGTGGCG
TCGCTGGAGC TGACGCCGCA GGCGTCGGTG GACGAGGACA CCCAGGTGCT GAGCGGGGTG
TCGTTCCGGG CGGAGCCGGG GACGATGGTG GCCCTGGTGG GGCCGTCCGG AGCGGGCAAG
ACGACGCTGA CGCACCTGGT GTCGCGCCTG TACGACCCGA CCGAGGGCCG GGTGCTCGTC
GGCGGGCTGG ACCTGCGCGA GGTGACCGGC GACTCGCTGC GCGAGGCGGT CGGCGTGGTC
ACCCAGGACG CGCAGCTGTT CCACGACACC GTGGGGGCGA ACCTGCGGTA CGCCCGGCCG
GAGGCGACCG ACGCCGAGCT GGAGGAGGTG CTGCGGATGG CGCGGCTGGG CACCCTCCTG
GACCAGCTGC CCAACGGGCT GGACACGATG GTCGGCGACC GCGGGTACCG GCTGTCGGGC
GGTGAGAAGC AGCGGCTGGC GATCGCGCGT CTGCTGCTGA AGGCGCCGTC GGTGGTGGTG
CTGGACGAGG CGACGGCCCA CCTGGACTCC GGGTCCGAGG CGGCAGTGCA GGAGGCGCTG
TCCGTGGCGC TGGAGGGCCG GACCTCGCTG GTGATCGCGC ACCGGCTGGC GACGGTGCGC
GAGGCGGACC AGATCCTGGT GCTGGAGGAC GGCCGGATCC TGGAGCGCGG CACGCACGAC
GAGCTGCTGG TCCAGGGCGG GCTGTACACG GCGCTGTACC GGACCCAGTT CGCTCCGCAG
AGCCGGTAG
 
Protein sequence
MTMMSGRPSY RALVNDGSAK GQSLPPGITR RIISYARPHW RVILCFLLVT TVGAGIVVAN 
PLLLKAIIDR GILTGNTALV VWLALAAAGL AVLESGLTLL GRWLSSRIGE GVIYQLRTQV
FTHVQRMPVA FFTRTQTGSL ISRLNTDVVG AQRAITSVLQ SVVSNVVSAT AVIVTMIALS
WQVTLIALAL VPLFVVPAKV IGRRLAHISR DAMDTNADMS SLMTERFNVG GAMLVKLYGR
PEEESAGFAR RASRVRDLGV RQAVFGSLLF SMLGLITALA MAMVYGVGGV LAIGGAFEVG
TLVALTTLLA RLYGPVTTLS NVHVEIMTAL VSFDRVFEVL DLEPGIRESP DARNLPGERL
GVEFDNVSFR YPAAKESSVA SLELTPQASV DEDTQVLSGV SFRAEPGTMV ALVGPSGAGK
TTLTHLVSRL YDPTEGRVLV GGLDLREVTG DSLREAVGVV TQDAQLFHDT VGANLRYARP
EATDAELEEV LRMARLGTLL DQLPNGLDTM VGDRGYRLSG GEKQRLAIAR LLLKAPSVVV
LDEATAHLDS GSEAAVQEAL SVALEGRTSL VIAHRLATVR EADQILVLED GRILERGTHD
ELLVQGGLYT ALYRTQFAPQ SR