Gene Ndas_4684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4684 
Symbol 
ID9248566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5556877 
End bp5558487 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content76% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003682576 
Protein GI297563602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCG CCTTCTCCGT CGTCTGCACG GGTCTGTCCT TCGCCTGGCC CGACGGCACG 
CCCGTCCTGA CCGGTCTGGA CGCCGCCTTC GGCACCGGGC GGACCGGGCT GGTCGGCCGC
AACGGCAGCG GCAAGTCCAC CCTGCTCCGC CTCGTCGCGG GGCGGCTGAC CCCGGCGTCG
GGCACGGTGG CCGTGGACGG CGACGTCGGC TACCTGGACC AGGGACTGAC CCTGGACACC
GGCCGCACCG TCGCCGAACT GCTGGGCGTC GACCGGGCCC GCACCGCGCT GCACGCCATC
GAGGCGGGCG AGGCCACCGA GGCGAACTTC GCCGCCGTCG GCGAGGACTG GGACGTGGAG
GAACGCGTCC TGGCCCAGCT CGAACGGTTC GGCGTCGCCC TGACGGGGGA CGCCCCGCTG
GACCGCCCCG TCGGCACCCT CTCCGGAGGG GAGGCCGTGC TGGTCGCGCT CGCGGGGCTG
GCGCTGCGCC GCCCCGCCGT CACCCTGCTG GACGAGCCCA CCAACAACCT GGACCGCCGC
GCCCGGGAGC GGCTCCACGA CGCGGTCGCG TCCTGGCAGG GGGTGCTCGT CGTGGTCAGC
CACGACCGCG AACTCCTGGA ACGCGTCGAC CAGATCGCCG AACTCCGGGA AGGGAGGATG
CGCGTGTGGG GCGGCAACTA CTCCGCCTAC ACCGAGCAGC TCGCCGCCGA GCAGGAGGCG
GCCCGGCGCA TGGTGCGCGT GGCCGAGGCC GACGTGCGCC GCGAGAAGCG CCAGCTCGCC
GACGCCCAGG TCAAGCTGGC CAGGCGGGTG CGTTACGGCA AGAAGATGTA CGACACCAAG
CGCGAGCCCC GCGTGGTCAT GAAGCAGCGC AAGCGCGACG CCCAGGTGGC GGCGGGCAAG
CACCGCGTCA TGCAGGAGGC CAAGCTGGTC GGGGCCAGGG AGGAGCTGGG CCGGGCGGAG
GACGCCGTCC GCGACGACGA CGCCATCCGC GTCGCCCTGC CCGGCACCGG GGTCCCGGCC
GGACACGGCG TCCTGGAGCT GGGCACTCCG GTCGGCGGGC GGCTGTACCT GCGCGGGCCC
GAACGGGTGG CGCTGGTCGG GCCCAACGGG TCCGGCAAGA CCACCCTGCT GCGGGCCGTG
GTCGGCCGGG GGCGCCACCC GCAGGCCGAG GTGGTGCACG CCGCCGACCG GATCGGCTAC
CTGCCGCAGC GGCTGGACGT GCTGGACGAC GGCCTGAGCG TGCTCGACAA CGTGCGCGCG
GCGGCGCCCT CCGCCACGCC GCACCGGGTG CGGGCCGGTC TGGCCCGCTT CCTCATCCGC
GGGGACCGGG TGGAGCAGAG GGCGGGCGAC CTCTCCGGCG GCGAGCGGTT CCGGGTGGCG
CTGGCCCGGC TGCTGCTGGC GGACGAGCCT CCCCGGCTGC TGGTGCTGGA CGAGCCCACC
AACAGCCTCG ACCTGGACAG CCAGCGGCAG CTGGCCGAGG CCCTCGCGGA CTACCGGGGC
GCCCTGCTGG TGGCCAGCCA CGACCACGCG TTCCTGCGCG AGGTCGGAGT GGGCCGCTGG
TGGCGGACCG AGCGCGGGCA CGCCCCGGTG GAGGTGTCCG GGGCCGACTG A
 
Protein sequence
MSSAFSVVCT GLSFAWPDGT PVLTGLDAAF GTGRTGLVGR NGSGKSTLLR LVAGRLTPAS 
GTVAVDGDVG YLDQGLTLDT GRTVAELLGV DRARTALHAI EAGEATEANF AAVGEDWDVE
ERVLAQLERF GVALTGDAPL DRPVGTLSGG EAVLVALAGL ALRRPAVTLL DEPTNNLDRR
ARERLHDAVA SWQGVLVVVS HDRELLERVD QIAELREGRM RVWGGNYSAY TEQLAAEQEA
ARRMVRVAEA DVRREKRQLA DAQVKLARRV RYGKKMYDTK REPRVVMKQR KRDAQVAAGK
HRVMQEAKLV GAREELGRAE DAVRDDDAIR VALPGTGVPA GHGVLELGTP VGGRLYLRGP
ERVALVGPNG SGKTTLLRAV VGRGRHPQAE VVHAADRIGY LPQRLDVLDD GLSVLDNVRA
AAPSATPHRV RAGLARFLIR GDRVEQRAGD LSGGERFRVA LARLLLADEP PRLLVLDEPT
NSLDLDSQRQ LAEALADYRG ALLVASHDHA FLREVGVGRW WRTERGHAPV EVSGAD