Gene Ndas_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4055 
Symbol 
ID9247927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4850381 
End bp4852024 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content75% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003681957 
Protein GI297562983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0422007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCC CGCCCCTGCT CCGGATGAGC GGCATCACCA AGTCCTTCCT GGGCGTGCGC 
GTCCTGCACG GGATCGACCT GGAACTCCAC CCCGGCGAGC TGCACGCCCT GGTCGGCGAG
AACGGGGCGG GCAAGTCCAC CCTGATGAAG GTGCTGGCCG GGGTGCACCG CGCGGACGGG
GGCACGGTCG AGCTGGAGGG CGGCACCGTC TCCTTCGAGC ACCCCGTCCA GGCCCAGCGC
GCGGGCGTGA CCACGGTCTT CCAGGAGTTC AACCTCCTGC CCGACCGCAC CGTCGCCGAG
AACGTCTTCC TCGGCCGCGA GATCCGCCGC CGCGGCCTGG TGGACGCCCG GGCCATGGAG
CGGGCCACCG CCGAACTGCT CGCCGAACTC GGCCTGGAGG GCATCGACCC CCGGGCCCGG
GTGCGGTCGC TGTCGGTGGC CGAACAGCAG ATCGTGGAGA TCGTCAAGGC GCTCTCGCAC
GACGCGCGCA TCATCTCCAT GGACGAGCCG ACCGCCGCGC TGGCCGACCA CGAGGTGGAG
GTGCTCTACC GGATCATCGG CCGCCTGCGC GAACGCGGCG TGGCGGTGCT GTACGTGTCG
CACCGCATGC GGGAGATCTT CGACCTGGCC GACACCATCA CGGTGCTCAA GGACGGCCAT
CTCGTGGACA CCGTCCCCGC CGGTGAGATC GGTCCGGCCG AGCTGGTCCG CAAGATGGTC
GGGCGTCCGG TCTCGGCGGT CTTCCCCGAG CCCCTGGAGC CGCACGGCGA GCACGTGGGA
CGGGTGCGGC TGTCGGTCAC CGGAGGCGGC AACACCCAGC TGGACGGGAT CGGCTTCGAG
GTGCGCGGCG GCGAGATCCT GGGCCTGGGC GGGCTCCAGG GCAGCGGGCG CACCGAGGTC
GCCCACGCCC TCTTCGGCGT CGAGCGCTTC ACCCGGGGCG AGGTCCGCGT GGACGGGCGG
CGGGTGGACC CGCGCTCGCC GCGCACGGCG GTGCGGGCGG GCCTGGTGCT GGTCACCGAG
GACCGCAAGG CGCAGGGGCT GGCGCTGAAC CAGTCGGTGG CGGCCAACGG CCGCCTGGTC
CTGGACGCGG TCTGGCCCCT GGGCTCGGCG CGCGGGGCCC GGCGGCTGCC CGGCATCCTC
TCCTCCCTGG AGCTGGTGGC GCGCGGCGGC CAGGACCAGG AGGTCCGGTA CCTGTCCGGC
GGCAACCAGC AGAAGGTCGT GCTGGCCAAG TGGCTGGCCG CCGAACCCGG CGTGATGGTG
CTCGACGAGC CCACGCGCGG CATCGACGTG GGCGCCAAGC AGGCCGTCTA CCGGCTCATG
CGCGAGCTGG CCGCGGCCGG TGTGGCGATC GTGCTCATCT CCTCCGAGCT GCCCGAGCTG
ATCGGCATGT CCGACCGGCT GGTCGTCCTG CGGGACGGCC GGGTGGCGGG CGAGCTGCCC
GGCGGGGCCG CCGAGGAGGC GGTCATGGCG GTGGCCACCG GATCGCCGCA CCCGGGCGGG
TCCGCCGCAC CGGTTCCCGG GCAGGACCCG GCCGCCGCTC CACCGCGCCC GGTCGCCCCC
GCCCCACCGA CCGCGGGGGG CGACCCGGCC GCCGGAACCG ACAACGGCGG CTCCCCCGCA
CGGCACGAGG AGGCCGCCCC GTGA
 
Protein sequence
MSGPPLLRMS GITKSFLGVR VLHGIDLELH PGELHALVGE NGAGKSTLMK VLAGVHRADG 
GTVELEGGTV SFEHPVQAQR AGVTTVFQEF NLLPDRTVAE NVFLGREIRR RGLVDARAME
RATAELLAEL GLEGIDPRAR VRSLSVAEQQ IVEIVKALSH DARIISMDEP TAALADHEVE
VLYRIIGRLR ERGVAVLYVS HRMREIFDLA DTITVLKDGH LVDTVPAGEI GPAELVRKMV
GRPVSAVFPE PLEPHGEHVG RVRLSVTGGG NTQLDGIGFE VRGGEILGLG GLQGSGRTEV
AHALFGVERF TRGEVRVDGR RVDPRSPRTA VRAGLVLVTE DRKAQGLALN QSVAANGRLV
LDAVWPLGSA RGARRLPGIL SSLELVARGG QDQEVRYLSG GNQQKVVLAK WLAAEPGVMV
LDEPTRGIDV GAKQAVYRLM RELAAAGVAI VLISSELPEL IGMSDRLVVL RDGRVAGELP
GGAAEEAVMA VATGSPHPGG SAAPVPGQDP AAAPPRPVAP APPTAGGDPA AGTDNGGSPA
RHEEAAP