Gene Ndas_2128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2128 
Symbol 
ID9245978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2547250 
End bp2548398 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680058 
Protein GI297561084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTGG GTTTTCTGCG ACCTCTGTAC GAGAGCGACG CTCCGGTGGC CTCGGTGCAC 
CTGGACACCA GCCGCGACAG GACCGACGCC GGCAAGGAAC TCGAACTGCG CTGGCGCCAC
CTGCGGGACG AACTCTCCTC GCTGGGCACC GACAAGGCGA CCCTGGACGT GCTGGAACAG
GCGGTCAGGG ACGGGTCCTC CCGGGCCTTC GGCAGTCACG GCCACTCCCT GTTCGCCTCC
GAGGGGCGTC TGCTCGGCGC GTACACGCTC TCCGAACCGC CCGCGCAGAG CCGCGCCCTG
CGCATGCCGG TTCCCGATCC CCTTCCCGTG GTCGTGGACC GGGGGCGCTA CCTGCCCTAC
GTCCTGGTGG CGCTGGACCG GGTCAACGCC AAGGTGTTCT CCTACACCGG CCACCCCTCC
AGCGAACCGG CCTCCGAGAA GGACTTCTCC GGTGCCGACC TGCGCAACAT CGACCCGATG
GGCCGCGGCG GGCCCGGTGT CCTGAGCGGG TACAACGGCC GCTTCGACGG CAAGCACTAC
CCCATGGAGA CCTGGCGGGA GAACACCGCC CGGATCGCCC AGCAGGTGCG CGAGGCCGTC
GCCGAGGTGG ACGCCCGGAT CATCTTCGTC GGCGGTGACG AGGAGGCCAT CGCCTACCTG
CGCGACAACC TGGGCGAGCG CAAGCTGAGC ATCCCGATCA GGCTGGTGGC CGGCGGACGG
GGCGGCCCCG ACGCCGAGGA GCGCCTGCAC GCGGCCGCGG CGGAGGCGCT GCGCGACTTC
GTCATCGACG GCCATGACGA CATCATCGCC GACTACCACC AGAAACTCGC CAACGACCAG
GCGGTGCGCG GCACCGAGCC CACCCTGCCG ATGCTGTCGG AGGCGCGAGT GCGCACCCTG
CTGCTGGGCG CCGACCGCGA CGGCGAACCC GAGCTGTGGG GCTCACCGGG CGAGCCGGTG
CTGGTGTCGA AGAACCCGGC GGACCTGGAC GACCCCGACG CGGCGTTCCG GGCACCGGCC
AGCGCGCTGA TGCTGCGTTC GGCGATGGCC GCCGACGCCG GTTTCGGCGA GGTCCTCGAC
CACGGCCACA CCAGCAACGA GAACGGCGCC ATCCTGCGCT TTCCCACGTC GCCGAACGAG
AGGGTCTGA
 
Protein sequence
MDLGFLRPLY ESDAPVASVH LDTSRDRTDA GKELELRWRH LRDELSSLGT DKATLDVLEQ 
AVRDGSSRAF GSHGHSLFAS EGRLLGAYTL SEPPAQSRAL RMPVPDPLPV VVDRGRYLPY
VLVALDRVNA KVFSYTGHPS SEPASEKDFS GADLRNIDPM GRGGPGVLSG YNGRFDGKHY
PMETWRENTA RIAQQVREAV AEVDARIIFV GGDEEAIAYL RDNLGERKLS IPIRLVAGGR
GGPDAEERLH AAAAEALRDF VIDGHDDIIA DYHQKLANDQ AVRGTEPTLP MLSEARVRTL
LLGADRDGEP ELWGSPGEPV LVSKNPADLD DPDAAFRAPA SALMLRSAMA ADAGFGEVLD
HGHTSNENGA ILRFPTSPNE RV