Gene Ndas_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0130 
Symbol 
ID9243961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp158839 
End bp160263 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF195 
Protein accessionYP_003678086 
Protein GI297559112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCC TCGCACTCGT ACTCATGCTC CTGATCGGTC TGGCGGTCGG TACCGCCGTG 
GGCTGGATCC TGGCCCGCGG CCGGGAGGCC GAGACGCGGG CCGACGCCCG CGCGGCCCAG
GAGCGGGCGG CCTACGTGGA GGAACAGCTG GCCGACCGGT TCCGCGCCCT GTCCGCGCAG
GCCCTGGACC AGACCAACCA GCGCTTCATG GAACTGGCGG AGGGGCGGCT GCGCGCGGTC
AGCGCCGAGG CCGGGGGTGA CCTGGACGAG CGGCGCCGCG CGGTGGAGCG CATGGTCGAG
CCGCTCACGC GGACCCTGGA CCGGGTGGAG CGCCAGCTGA GGGAGGCCGA CGCGGGGCGC
GCGGCGGCCC ACGCGGAGCT GGCCAAGCAG GTCGAGTACG TGCGCGAGGG CTCGGAGCGG
CTGCGCGACC AGACGCAGTC GCTGGTGACG GCGCTGCGAC GGCCCGAGGC GCGCGGCCGG
TGGGGCGAGC TCCAGCTGCG CAGGGTGGCG GAACTGGCGG GGATGGGCTC GCACTGCGAC
TTCGAGGAGC AGGCCGCCAC GCGCGACGGG TCGCGGCGCC CGGACATGGT GGTGCGCCTG
GCGGGCGGCA AGAACATCGT GGTGGACTCC AAGGTCCCGC TGGCGGCCTA CCTGGACGCC
GTGGAGGCGG GCGCGGAGGA GGCCGCGCGG CACCTGCGCG TGCACGCCAG GCACCTGCGC
ACGCACGTGG ACCAGCTGGC GGCCAAGGCG TACTGGTCGG CGTTCACCCC GGCGCCGGAG
TTCGTGGTGC TGTTCATCCC CGGGGAGGCG TTCCTGGCCC CGGCCCTGGA GTACGACCCG
GAGCTGCTGG AGTACGCGAT GGGGCGGCGG GTGCACATCG CGACGCCGAC GACGCTGATC
TCGCTGCTGC GGACGGCGCA GTACGCCTGG CAGCAGGAGG CGCTGAGCGA GAACGCGCGG
GCCGTGTTCG ACCTGGGCAA GCAGCTGCAC GAGCGGCTGG GCACCCTCGG CGGGCACGTG
GAGGGACTGG GCCGGGCCCT GACCCGGACG GTGAGCGCCT ACAACCAGAC CGTGGGGTCG
CTGGAGAGCC GGGTGCTGGT GACGGCCCGC CGGTTCGGGG AGCTGGGGCT GGTGGACGGC
GAGCTGGACC GGCCGGTGGG CGTGGAGGAG CGTCCGCGCG CGGTGGCGGC GCCCGAGCTG
TCGGACCCGG GCGGCGCCGG ATCGGCCGCG CGGGAGCGGA GTGGGCCGGA CTCGGGCGGG
CAGGGGAACG GAACGGACGT TGTTCGCGAA ACCTCGGAAC CGCAGTGGCC CAAGCCGGTG
TCCAACGGGG AGGAAGGCGT GTTCGTCGCG GATGGGACAA GTAGCCGGAA GGGCGACACG
ATCCGTGGAA AAGACACCGA ACATTCACGA GAAGTGACAC ACTAA
 
Protein sequence
MDGLALVLML LIGLAVGTAV GWILARGREA ETRADARAAQ ERAAYVEEQL ADRFRALSAQ 
ALDQTNQRFM ELAEGRLRAV SAEAGGDLDE RRRAVERMVE PLTRTLDRVE RQLREADAGR
AAAHAELAKQ VEYVREGSER LRDQTQSLVT ALRRPEARGR WGELQLRRVA ELAGMGSHCD
FEEQAATRDG SRRPDMVVRL AGGKNIVVDS KVPLAAYLDA VEAGAEEAAR HLRVHARHLR
THVDQLAAKA YWSAFTPAPE FVVLFIPGEA FLAPALEYDP ELLEYAMGRR VHIATPTTLI
SLLRTAQYAW QQEALSENAR AVFDLGKQLH ERLGTLGGHV EGLGRALTRT VSAYNQTVGS
LESRVLVTAR RFGELGLVDG ELDRPVGVEE RPRAVAAPEL SDPGGAGSAA RERSGPDSGG
QGNGTDVVRE TSEPQWPKPV SNGEEGVFVA DGTSSRKGDT IRGKDTEHSR EVTH