Gene Ndas_5513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5513 
Symbol 
ID9249416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp704439 
End bp705905 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content69% 
IMG OID 
Productamino acid carrier protein 
Protein accessionYP_003683398 
Protein GI297564425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0538617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.636918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCCA TCCAAGCCGG GATCGTCGCC CTCAACGACA TGTTCTGGGC CTACCTGCTG 
ATCCCCCTCC TCCTCATCCT GAGCCTGTAC TTCACCGTGC GCTCCGGCGC CGTGCAGTTC
CGCCTGATCC CCGACATGTT CCGCTCCATG CGGGGCGAAC CCGGGCTGGC TCCCGACGGC
AAGAAACCGA TCTCCGGCCT CCAGGCGTTC GCCGTCTCCG CCGCCGCGCG CATCGGTACC
GGCAACATCG CCGGTGTGGC CACCGCGATC GCCCTCGGCG GTCCCGGCGC GGTCTTCTGG
ATGTGGGCCA TGGCGCTGCT CGTCGGCGCC GCCAGCTTCG TGGAGTCGAC GCTGGCCCAG
CTCTACAAGA TCCGTGACAA GGCCGGCTAC CGGGGCGGAC CCGCCTACTA CATGCAGTAC
GGGCTCAAGT CCCGTTGGAT GGGCGTGCTC TTCGCCGTCG TCATCACCTT CACGTTCGGC
TTCGTGTTCA CGAGCGTGCA GAGCAACACG ATCTCCGCCG CGATCGCCAA CTCGGTCTCC
ACCGCCTCGG GCACCGGGAC CGCGCCGGGC TGGCTCGCCT ACGCCGTGGG CGCGGTGCTG
GCCGTGTGCA CCGCCGTGCT CATCTTCGGC GGCGCCCGGC GCATCGCCCA GGCCGCCACG
GCGCTGGTCC CCGTCATGGC GGGCGTCTAC ATCCTCATGG GCCTGGTCGT CATCGTGATG
AACATCGGCC AGATCCCCGC GATGGTCGTC GACATCGTCA CGCACGCCTT CGGTCTGCGC
GAGATCGCCG CGGGCGGCAT CGGCACCGCG ATCGTGCAGG GGATGCGGCG CGGGATGTTC
TCCAACGAGG CCGGTCTGGG CTCGGCGCCC AACGCCGCCG CCAGCGCCGC GGTGAGCCAC
CCCGTCAAGC AGGGCCTCGT GCAGACCCTG GGCGTGTACT TCGACACCCT CGTCGTCTGC
TCGACCACGG CGTTCGTCAT CCTGCTGTCC GACCCCAGCT ACACCGCCGA GGCCGGTCCG
ACGCTGACCC AGAACGCCCT GGAAACCAAC CTCGGCCCCT GGGCGCTGCA CCTGCTCACG
CTGATCATCC TGCTGGTGGC CTTCACCTCG GTGCTCGGCA ACACCTTCTA CGGCGAGGCC
AACATCGGCT ACCTGACCAA GAGCCCCCAG GCCACGACCG CCTTCCGGGT GCTGGTCATC
GTGGTGACCT TCCTCGGAGC CATCGGCTCC GGCGGCCTGG TCTGGAGCCT GGCCGACGTC
ACGATGGGCG TCATGGCCGT GGTCAACCTC GCCGCGCTCG CCCTCCTCGC GCCGACCGCC
TGCCGACTGC TGCGCGACTT CGTCGGCCAG CGCAAGCAGG GCAGGGACCC GCTGTTCACC
AAGGACCTCA TGCCCGACCT CACGGGCGTG GAGTGCTGGG AGCAGGAGGA CATGGACCGC
CTCAGGGGGA ACCCCGCCGG GGTCTGA
 
Protein sequence
MDAIQAGIVA LNDMFWAYLL IPLLLILSLY FTVRSGAVQF RLIPDMFRSM RGEPGLAPDG 
KKPISGLQAF AVSAAARIGT GNIAGVATAI ALGGPGAVFW MWAMALLVGA ASFVESTLAQ
LYKIRDKAGY RGGPAYYMQY GLKSRWMGVL FAVVITFTFG FVFTSVQSNT ISAAIANSVS
TASGTGTAPG WLAYAVGAVL AVCTAVLIFG GARRIAQAAT ALVPVMAGVY ILMGLVVIVM
NIGQIPAMVV DIVTHAFGLR EIAAGGIGTA IVQGMRRGMF SNEAGLGSAP NAAASAAVSH
PVKQGLVQTL GVYFDTLVVC STTAFVILLS DPSYTAEAGP TLTQNALETN LGPWALHLLT
LIILLVAFTS VLGNTFYGEA NIGYLTKSPQ ATTAFRVLVI VVTFLGAIGS GGLVWSLADV
TMGVMAVVNL AALALLAPTA CRLLRDFVGQ RKQGRDPLFT KDLMPDLTGV ECWEQEDMDR
LRGNPAGV