Gene Ndas_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1374 
Symbol 
ID9245224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1688985 
End bp1690502 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content73% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003679312 
Protein GI297560338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.770006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA CCTCCGCCGC GCCCCCGGCC GACACCGACG TCGTGGCCCG TGTCCGCGGC 
GCGACCAAGA GCTACCCCGG CGTCCGGGCG CTGGACCGCG CCGACTTCGA GATCAGTGCC
GGAGAGGTCC GCGCGCTGCT CGGACGCAAC GGCGCCGGAA AGTCCACCCT GATCCGCCTG
CTGTCCGGCG TGGAGACACC CGACGAGGGC GAGATCGAGA TCGGCGGCCA GCCGCTCGGC
CAGGGCGGCA TCCGCCGGGC GGCCAAGCTC GGCGTGGGCA CCGTCTACCA GGAGCTCAGC
CTGGTCCCGG AGCTGTCGGC GGCCGAGAAC CTCTACCTCG GCACGTGGCC CAAGGCCGCC
GGGCGCATCG ACTACGGCCG CATCAGGGCC GGGGCCGAGG AGGTCTTCGC CGAACTCGGC
GTCGACATCG CCCCCGACAC CCGGGTCGGC GAACTGCCCC TGGCCCAGCA GCAGCTCGTG
GAGATCGCGC GGGCCTTCCG CGCCCGGCCG CGCCTGCTCA TCCTCGACGA GCCCACCAGC
GCGCTCGCCG CCGGTGAGGC CGAGACCGTC CTCAAGGCCG TCGAGCGCGT CGCCTCCCGC
GGCGTCGGCG TCATCTACGT CAGCCACCGC CTGGACGAGA TCCGCCGGGT CGCCGACACG
GTCACCGTCA TGCGCGACGG CGCCGTCGTG GAGACCACGC CGGTCCGCGG CGCCACCACC
CGGCACATCG TCTCCCTCAT GCTCGGAGGC GAGACCAAGG AGGAGAACAG GCCGGTGCGC
CGCGCCGCCG CCTCCGGAAC ACCGCTCCTG TCGGTCCGCG ACCTGGCGGT ACCGCCCAAG
GTCGACGGCG TCTCCTTCGA CCTGCACCCC GGCGAGGTCC TGGGCCTGGG CGGCCTCATG
GGCTCGGGGC GGACCGAGAT CCTGCGCGCG CTCGCGGGCT TCACCCCCTC ACGGGGGACC
GTGGAGGTGG ACGGCTCGCC GGTGGCCCGC CCCACCCCCC GCGCGATGAA GCGCCTGGGC
GTGGGCATCA CCCCCGAGGA CCGCAAGGGA GAGGGCGTCG TCCCCCTCCT GGGCGTCTCC
GAGAACATGG TCATGACCTG GTTCGGCGGA GCCTCCAAGG CGGGGACCGT GCTCCCCTCC
CGCGTCTCGG GGATCGGCCG GGGCCTCATC GACCGGCTCT CGATCAAGGC CGCCGCGACC
GACACCCCCA TCGTCAACCT CAGCGGCGGC AACCAGCAGA AGGCCGTCAT CGGCCGCTGG
CTGCACGCCG GGAGCCGCAT CCTGCTGCTG GACGAGCCCA CCCGCGGCGT GGACGTCGAG
GCCAAGGCCC AGATCTACGC CATCGTGCGC GAGCTGGCCG GACAGGGGGC CGCGGTCCTC
TTCGTCTCCA GCGAACTGGA GGAGCTCCCG CTCGTGTGCG ACCGCGTGCT CGCCCTGCGC
GGAGGACGGC TGCAAGGCGA GTTCACCGGC GACGACATCA CCCTGGACAA CATCATGGCC
GCCGCGATGG CGGCGTGA
 
Protein sequence
MSDTSAAPPA DTDVVARVRG ATKSYPGVRA LDRADFEISA GEVRALLGRN GAGKSTLIRL 
LSGVETPDEG EIEIGGQPLG QGGIRRAAKL GVGTVYQELS LVPELSAAEN LYLGTWPKAA
GRIDYGRIRA GAEEVFAELG VDIAPDTRVG ELPLAQQQLV EIARAFRARP RLLILDEPTS
ALAAGEAETV LKAVERVASR GVGVIYVSHR LDEIRRVADT VTVMRDGAVV ETTPVRGATT
RHIVSLMLGG ETKEENRPVR RAAASGTPLL SVRDLAVPPK VDGVSFDLHP GEVLGLGGLM
GSGRTEILRA LAGFTPSRGT VEVDGSPVAR PTPRAMKRLG VGITPEDRKG EGVVPLLGVS
ENMVMTWFGG ASKAGTVLPS RVSGIGRGLI DRLSIKAAAT DTPIVNLSGG NQQKAVIGRW
LHAGSRILLL DEPTRGVDVE AKAQIYAIVR ELAGQGAAVL FVSSELEELP LVCDRVLALR
GGRLQGEFTG DDITLDNIMA AAMAA