Gene Ndas_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1838 
Symbol 
ID9245688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2245449 
End bp2247284 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content74% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003679772 
Protein GI297560798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.355315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.263806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGG GCACGGACCG TCTCCAGCCC CCGGTGACGC GGTGGGTGCT TCCCGCCCTG 
GTCCGGTACT CCTTCTCCGT GGCACCGGGT GCGGCCGCCC TGACCATGAT GCTGGCCGTC
CTCGCCGGTG CGGCCCCGGT CGGCATCACC ATCGGGGTGG GCGCACTGGT GGACGGGCTC
ACCTCGGCCG CCGGCCAGGG GCTGGACGGT CCGGCGGCGC GGGAGTGCTA CCGGTGGGTC
GCCCTCATCG CCGTGCTCTT CCTCCTCACC CACGTCACCG AGTCGGCGCG CACGGCTCTG
GGACGGGCCC TGGGCCGCCG GGTCACCGGC CGACTGAGGG AGCGCGTGAT GGCGGCCGCC
TCGGAACCGG CGACCGTCGG CCACCTGGAG GACCCGGCCT ACCAGGGCAG ACTCGGGCGT
GCGCGGGGCG AGGGGACCAT CGACATGCCG CCGGGGGAAG CGGTCTTCGG CCTGTCGGCC
AAGGCCTCGT TGTGGGTGAC CGCCGTCGGC TCCGCGGCCC TGCTCACCCA GGTCGCCTGG
GGACTGGGGC TCATCGTCTT CGCGGTCTTC GCGGTGCTCC ACCTCCGGCT GGTCCGCAAC
TACCGGACCG CCGTCGTCGA GACCGTGAAC CAGACGCGGA AGCTGCGCCG TACGTCCTAC
CTGCGGGACG TGCCGACCAC GCCGGACGCG GCCCGGGAAG TGCGCCTCTT CGGGCTCACG
GCCTTCTTCC GCGACGCCTA CCACGCCGAG TGGCGCGTGA ACATGACCGA GGTCTGGCGG
CGACGGCGCG AGCACCGCGT GTTCGTCGTC GTGATCGTCG TGGTCACCGG CCTCACGGTC
GCCTCGGTCT TCTACTATCT GGCACACCGG GCCGCCGCCG GAGGCACGAC GGTGGGCGAC
CTGGCGCTGG GCGGCCTGGC CCTGCGGGCC CTCCTCCAGG TGCTCCGGAC GGACGAGGAC
GACCTGCGCA CCAGCTTCGG CTCCAAGGCG GCCGCCGAGG CGTTCGCCCT TCCGGAGGCC
GACCCGCCCG GCGGGTCCGA CCCCGAGCCC TGGACGGAGC CGGTCGCGAC CGTCTCCTGC
GAGGGGCTGC GCTTCCGCTA CGGCGGAGCC GCGGGCGACG TGCTGCACGG CATCGACCTC
GACGTCCCCG AGGGGCAGTC GCTCGCCATC GTGGGCCTCA ACGGAGCCGG CAAGACGACG
CTGGCCCGGC TCCTCGCCGG TCTGGACGCT CCGAGCGGGG GCCGCCTGCG CGTCGGCGGA
ACCACGGTCG AGGACGCCAA CCGCCGGAGC TGGCAGCGCC GGGTGGTCGC CGTCTTCCAG
GACTTCGGCC GCTACGAGCT GACCGTGCGG GACAACATCG CCTTCGGCTC CCTGGCCCAC
GCCGACGACG AGGAGGGCCT GCGGGAAGCG GCGCGCCAGG CCGGACTGCT GGAGTTCGTC
GAGGGACTCC CGAAGGGCTG GGACACGGTG ATGTCCAGCG GGTACGAGGG CGGGGTCGAC
GCGTCGGGCG GCGAGTGGCA GCGCGTCGCC ATCGCCCGGG CGCTGTTCGG CCTGCGGCAC
GGCGCACGGC TCCTGATCAT GGACGAGCCG GCCGCGAGCC TGGACGCCCG CGCCGAGGCC
CGGCTCTACG ACACGTTCCA CGAGCTCACC GCGGGGGCCA CGACCGTCGC GATCTCCCAC
CGGTTCGCGA CCGTGCGCAG GGCGGAGCGG GTGGTCGTCC TCGACCGGGG CCGCATCGTC
GAGGACGGCA CGCACGAGGA CCTGATGAGC GCCGACGGCC GGTACGCCGA GCTGTTCCGT
CTGCAGGCGA AGCGTTTCGA GGAGGCGTCA TCGTGA
 
Protein sequence
MTAGTDRLQP PVTRWVLPAL VRYSFSVAPG AAALTMMLAV LAGAAPVGIT IGVGALVDGL 
TSAAGQGLDG PAARECYRWV ALIAVLFLLT HVTESARTAL GRALGRRVTG RLRERVMAAA
SEPATVGHLE DPAYQGRLGR ARGEGTIDMP PGEAVFGLSA KASLWVTAVG SAALLTQVAW
GLGLIVFAVF AVLHLRLVRN YRTAVVETVN QTRKLRRTSY LRDVPTTPDA AREVRLFGLT
AFFRDAYHAE WRVNMTEVWR RRREHRVFVV VIVVVTGLTV ASVFYYLAHR AAAGGTTVGD
LALGGLALRA LLQVLRTDED DLRTSFGSKA AAEAFALPEA DPPGGSDPEP WTEPVATVSC
EGLRFRYGGA AGDVLHGIDL DVPEGQSLAI VGLNGAGKTT LARLLAGLDA PSGGRLRVGG
TTVEDANRRS WQRRVVAVFQ DFGRYELTVR DNIAFGSLAH ADDEEGLREA ARQAGLLEFV
EGLPKGWDTV MSSGYEGGVD ASGGEWQRVA IARALFGLRH GARLLIMDEP AASLDARAEA
RLYDTFHELT AGATTVAISH RFATVRRAER VVVLDRGRIV EDGTHEDLMS ADGRYAELFR
LQAKRFEEAS S