Gene Ndas_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2304 
Symbol 
ID9246154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2752541 
End bp2754058 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content74% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003680232 
Protein GI297561258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCAG CAGAGGTCCC CCCACCGCTC GCCCTGGTCG ACGTCACCAA GTCCTTCGGA 
AGCGTCCGGG CCCTGCGGGG GCTGTCCCTG GAACTGCGCT CCGGAGAGAT CCACGCACTG
GTGGGCGAGA ACGGGGCGGG CAAGTCCACC CTGGTCAAGA CCATCGCGGG GGTGCACCGA
CCGGACGGCG GGCAGGTGCT CGTGGACGGC CGCCCCGCGG AGTTCGCCGC GCCCGTGGAC
GCCCAGCGCG CCGGGGTCGC GGTCATCTAC CAGGAACCCA CGCTGTTCCC CGACCTGTCG
GTGGCCGAGA ACATCTTCGT CGGCCGCCAG CCCCGCACTC GCCTGCGCAC CATCGACCGC
GGCCGCATGC GCCGCGACAC CCGGGAGGTC TTCGCGCGCC TGGGCGTGGA CATGGACCCC
GACCGCCCCG CACGCGGGCT GTCCATCGCC GACCAGCAGC TCGTGGAGAT CGCCAAGGCC
CTCACCCGCC AGGCGCGGGT CCTGGTCATG GACGAGCCCA CCGCCGCCCT GTCCGGTGTG
GAGGCCGAAC GCCTGTTCAC CGTCGCCCGC ACCCTGCGCG ACTCCGGTGC GGCGCTGCTG
TTCATCTCCC ACCGCTTCGA CGAGGTCTTC GCCCTGTGCG ACCGCGTCAC CGTGGTCCGC
GACGGCGCGT TCGTCTCCTG CGACCCCACC GGCGACCTGG ACGTGGACAC CGTCGTGCGC
CGCATGGTCG GCCGCGAGGT CAGCAGCCTC TACCCCAAGG AGGAGGCCGA GCGCGGCGAG
GTCCTGCTGG AGGTCGACGG CCTGACCCGC CACGGCGTGT TCGCCGACGT GTCCTTCTCG
GTGCGCGCCG GGGAGATCGT CGCCCTGGCA GGGCTGGTCG GCGCCGGACG CAGCGAGGTC
GTCCGCGCCG TGTTCGGCGT GGACCGCTAC GACGCGGGCA CCGTGCGGGT GGAGGGCAGG
CCGCTGGCGC CGGGGCGGCC CCGGGCCGCC ATGGCCGCCG GGCTGGCCCT GGTCCCCGAG
GACCGCCGCC AGCAGGGCCT GGTCATGGAG TCCTCCATCG AGCGCAACGC CACCGCCACC
CGCCGCCGAG CGCTCAGCCG CCTGGGGCTG CTGCGCCCCC GAGCCGAACG CGACTCCGCC
CGCGAGTGGG GCGAGCGCCT GAACCTGAGG TTCGGCCGCC TCACCGACCC CGTCTCCACC
CTCTCCGGCG GCAACCAGCA AAAAGTCGTC CTGGCCAAGT GGCTGTCCAC CGATCCCCGC
GTGCTGTTCG TGGACGAGCC CACACGCGGC ATCGACGTGG GCACCAAGGC CGAGGTCCAC
CGCCTGCTGT CCGCCCTCGC GGGCCGGGGG CTGGCCATCG TCATGGTCTC CAGCGAACTC
CCCGAGGTCC TGGGCATGGC CGACCGGGTC CTGGTCATGC ACGAGGGCCG GATCACCGCC
CACCTGGACC GCGCGGAGGC CGACGAGGAG TCGGTCATGT ACGCGGCCAC CGGACAGCGG
CCGAGCGAGG CCGCATGA
 
Protein sequence
MDPAEVPPPL ALVDVTKSFG SVRALRGLSL ELRSGEIHAL VGENGAGKST LVKTIAGVHR 
PDGGQVLVDG RPAEFAAPVD AQRAGVAVIY QEPTLFPDLS VAENIFVGRQ PRTRLRTIDR
GRMRRDTREV FARLGVDMDP DRPARGLSIA DQQLVEIAKA LTRQARVLVM DEPTAALSGV
EAERLFTVAR TLRDSGAALL FISHRFDEVF ALCDRVTVVR DGAFVSCDPT GDLDVDTVVR
RMVGREVSSL YPKEEAERGE VLLEVDGLTR HGVFADVSFS VRAGEIVALA GLVGAGRSEV
VRAVFGVDRY DAGTVRVEGR PLAPGRPRAA MAAGLALVPE DRRQQGLVME SSIERNATAT
RRRALSRLGL LRPRAERDSA REWGERLNLR FGRLTDPVST LSGGNQQKVV LAKWLSTDPR
VLFVDEPTRG IDVGTKAEVH RLLSALAGRG LAIVMVSSEL PEVLGMADRV LVMHEGRITA
HLDRAEADEE SVMYAATGQR PSEAA