Gene Ndas_3426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3426 
Symbol 
ID9247293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4095555 
End bp4097219 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content67% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003681337 
Protein GI297562363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.543642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGT ACATCTACAC CATGAACAAC GTGCGCAAGG CGCACGGCGA CAAAGTCGTC 
CTGGACAACG TTTCGGGGTC GTTCCTGCCC GGAGCCAAGA TCGGCGTCGT GGGCCCCAAC
GGTGCGGGTA AGTCGACGCT TCTGAAGATC ATGGCCGGCA TCGAGCAGCC GTCCAACGGC
GAGGCGAGGC TCATGCCCGG TTTCACCGTC GGACTGCTCG CGCAGGAACC CCACCTCGAC
CCGGACAAGA CCGTCCTGGA GAACGTCGAG GACGGCGTCG CCGAGACCAA GGCGATGCTG
ACCCGCTTCA ACGAGATCGC CGAGCAGATG GCGACGGACT ACTCCGACGA CCTGCTCGAG
GAGATGGGCA AGCTCCAGGA GCAGCTCGAC CACCGGGGCG CCTGGGACCT GGACAGCCAG
CTGGAGCAGG CGATGGACGC GCTGCGCTGC CCGCCCGGCG ACGCCTCCGT CACCCAGCTC
TCCGGTGGCG AGAAGCGCCG CGTGGCCCTG TGCAAGCTCC TGCTGGAGCA GCCCGACCTG
CTGCTGCTCG ACGAGCCCAC CAACCACCTC GACGCCGAGA GTGTGAACTG GCTGGAGCAG
CACCTGGCCA AGTACCCGGG CACCATCATC GCGATCACAC ACGACAGGTA CTTCCTGGAC
CACGTCGCCA CGTGGATCCT GGAGCTGGAC CGGGGCCAGT TCTACCCCTA CGAGGGCAAC
TACAGCGTCT ACCTGGAGAC TAAGCAGGCC CGCCTGAAGG TCGAGGGGCA AAAGGACGCC
AAGAAGGCCA AGCGGCTCAA GGACGAGCTG GAGTGGGTCC GCTCCAACGC CAAGGCCCGC
CAGACCAAGA GCAAGGCCCG CCTCCAGCGC TACGAGGAGA TGGCCGCCGA GGCCGACAAG
ACCCGCAAGC TGGACTTCGA GGAGATCCAG ATCCCGCCGG GCCCGCGCCT GGGCAACACC
GTGGTCGAGG TCAAGAACCT CACCAAGGGC TTCGGCGACC GCGTGCTCAT CGAGGACCTG
AGCTTCTCGC TGCCGCCCAA CGGCATCGTC GGCGTCATCG GCCCCAACGG TGTCGGCAAG
ACGACGCTGT TCAAGATGAT CGTCGGCGAG GAGACCCCCG ACCAGGGCAG GATCAACGTC
GGCGAGACCG TCGAGATCTC CTACGTCGAC CAGTACCGCG GCCGCATCGA CGACACCAAG
AACGTCTGGG AGACCGTCTC CGACGGCGAG ACCTTCATCC AGGTCGGCAA GGTCGAGATC
CCCAGCCGCG CCTACGTCGC CGCGTTCGGC TTCAAGGGCT CCGACCAGCA GAAGCCGTCC
GGTGTGCTCT CCGGCGGTGA GCGCAACCGC GTGAACCTGG CGCTCACCCT CAAGCAGGGC
GGCAACCTGC TGCTCCTGGA CGAGCCCACC AACGACCTGG ACGTGGAGAC CCTCGGCTCG
CTGGAGAACG CGCTGCTGGA CTTCCCCGGC TGCGCCGTGA TCACCTCCCA CGACCGCTGG
TTCCTCGACC GCGTCGCCAC GCACATCCTC GCGTGGGAGG GCGACGCCAA CTGGTACTGG
TTCGAGGGCA ACTTCGAGTC CTACGAGAAG AACAAGGTCG AGCGCCTGGG CCCGGACGCC
GCGCGCCCGC ACGCGGTCAC CCACCGCAAG CTCACCCGCG ACTGA
 
Protein sequence
MPEYIYTMNN VRKAHGDKVV LDNVSGSFLP GAKIGVVGPN GAGKSTLLKI MAGIEQPSNG 
EARLMPGFTV GLLAQEPHLD PDKTVLENVE DGVAETKAML TRFNEIAEQM ATDYSDDLLE
EMGKLQEQLD HRGAWDLDSQ LEQAMDALRC PPGDASVTQL SGGEKRRVAL CKLLLEQPDL
LLLDEPTNHL DAESVNWLEQ HLAKYPGTII AITHDRYFLD HVATWILELD RGQFYPYEGN
YSVYLETKQA RLKVEGQKDA KKAKRLKDEL EWVRSNAKAR QTKSKARLQR YEEMAAEADK
TRKLDFEEIQ IPPGPRLGNT VVEVKNLTKG FGDRVLIEDL SFSLPPNGIV GVIGPNGVGK
TTLFKMIVGE ETPDQGRINV GETVEISYVD QYRGRIDDTK NVWETVSDGE TFIQVGKVEI
PSRAYVAAFG FKGSDQQKPS GVLSGGERNR VNLALTLKQG GNLLLLDEPT NDLDVETLGS
LENALLDFPG CAVITSHDRW FLDRVATHIL AWEGDANWYW FEGNFESYEK NKVERLGPDA
ARPHAVTHRK LTRD