Gene Ndas_2598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2598 
Symbol 
ID9246449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3095120 
End bp3096442 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content79% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680522 
Protein GI297561548 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00331203 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0245461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAAGA AGGGCCGCGG GCGACAGCGC GGCGGGGACC GCCGCACGGC CGGGCAGGGC 
GCCCAGCCCT GGGCCGGTCC CGCGCCCGGG GACTCCCCCG AGCAGGTCGT CTCGGAGGCG
GTCGACGCGC TGGTGCTCGG CCGGGAGGGC GGCGGCGTGG ACCTGGCCGC CGCGCGCCTG
GCCGACGCCG AGGACCCGGC GCGGGCCGGG GCGGCGGACC GCGCCGTCCG CGACGCCCTC
CTGGCGGGGG TGGCCTCGGC GTGGGCGCGC GGGTGGCAGC CCGCCGAGAC GGTCCGCCAG
GCGGGCCGCG TACTGGACCC GGTGGCGGCG GCGGTGTGCG CCGACGCGGT GACGGCCGAC
CTCGACCGCC ACTCCCCCGA CACGGTGGAC CCGCGCTGGA CCGCGCAGGT GCGCGAACTG
GGCGCCGCCC CGGCCTGGGG CGGCACCGGG GACTACCTGG CGCACGTGGG CGCCGGGCAC
GGGCTGCTGC GCTTCGAGGC GGTGGAGACG GCGCTGCGGC TGCTGGCCGC GCTGCGCGTG
CTGCCGCCGC TGCACCGGCT GTGCCCGCCG CCGGGGGCGT TCCGTCCGCG GCCCGAGGGC
TCCGCGCGGA CCCCGGAGGC GGTGGACCAG GGCAAGCTGG CCCGCGTGCG CGCGCTGCTG
GCCAAGGCCG AGTCCACGGA GTTCCCCAGC GAGGCGGAGG CGCTGAGCGC CCGCGCGCAG
GAGCTGATGG CCCGGCACAG CATCGACCGC GCCCTGCTGG CCGAGGAACC CGGGCAGGCG
TCGGAGGCCG CCTCGGGGCG GCGGCTGCCG GTGGACGCGC CCTACGACGA GCACAAGGCG
GTGCTGCTGC ACGAGGTCGC CGAGGCCAAC CACTGCCGGG CCGTGTGGGA CCGGGAACTG
GGGCTGTGCA CGGTGATGGG CTTCCCGGGC GACGTGGAGG CGGTGGACCT GATGTTCACG
TCGCTGCTGG TGCAGGCCGA GACGGCGATG CGGGCCAGCG GTTCGGTGCG CGACCGGTCC
GGCAGGGCGG CGGGCAGGGC CTTCCGCGCC TCGTTCATGT CCGCGTTCTC GGTGCGGGTG
GGTGAGCGGC TGGTGGAGTC GGCCCGCTCC GCCGAGGAGG CCGCCGCGGC CGAGACCGGC
ACGGACCTGG TTCCGGTGTT CGCCGCCCGC GAGCGCGAGG TGGAGGCCGC CGTGGCCGAG
GCCTTCGGCG AGCTGACCTA CTCGCGGGTG CGCGGGCCCG CCAGCGAGGA CGGCTGGTAC
GAGGGCCTGG CGGCGGCCGA CGCCGCGTCC CTGGGCGCGC ACCGCCGCGT CGCGGACCGC
TGA
 
Protein sequence
MGKKGRGRQR GGDRRTAGQG AQPWAGPAPG DSPEQVVSEA VDALVLGREG GGVDLAAARL 
ADAEDPARAG AADRAVRDAL LAGVASAWAR GWQPAETVRQ AGRVLDPVAA AVCADAVTAD
LDRHSPDTVD PRWTAQVREL GAAPAWGGTG DYLAHVGAGH GLLRFEAVET ALRLLAALRV
LPPLHRLCPP PGAFRPRPEG SARTPEAVDQ GKLARVRALL AKAESTEFPS EAEALSARAQ
ELMARHSIDR ALLAEEPGQA SEAASGRRLP VDAPYDEHKA VLLHEVAEAN HCRAVWDREL
GLCTVMGFPG DVEAVDLMFT SLLVQAETAM RASGSVRDRS GRAAGRAFRA SFMSAFSVRV
GERLVESARS AEEAAAAETG TDLVPVFAAR EREVEAAVAE AFGELTYSRV RGPASEDGWY
EGLAAADAAS LGAHRRVADR