Gene Ndas_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3072 
Symbol 
ID9246928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3670646 
End bp3671482 
Gene Length837 bp 
Protein Length278 aa 
Translation table11 
GC content77% 
IMG OID 
ProductShikimate dehydrogenase substrate binding domain protein 
Protein accessionYP_003680987 
Protein GI297562013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.347553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.491386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACGG TGCGCGCCGC CGTCCTGGGC TCGCCGGTGG CCCACTCGCT CTCACCGGTC 
CTGCACACCG CCGCCTACGC GGCGATGGGC CTGGACGAGT GGTCCTACGG CCTGCACGAG
TGCGGTGAGG AGGAACTCGC GCCCTTCCTC GCCGGACTCG GCGGGGAGTG GGCCGGGCTT
TCCCTGACCA TGCCGCTCAA GCGCCGCGCC CTGGAGCTGG CCGAGACGGT CTCCGACCTG
GCCCTCCAGG CGGGCGGCGC CAACACCCTC GTCCACCGCG GGCGTGAGTG GCACGCCCAC
AACACCGACG TGGCGGGGAT CACCGCGGCC CTGGCCGAGG CCGGGGCCGA CGCCCCCCGC
AGCGCGGTCG TCCTGGGCGC CGGGGCCACG GCGGCCTCCG CCCTCGTCGC GCTGCGCCTG
CTCGGCCTGA CCGCGCCGGT CACCGTGCTG GCCCGCGACC CGGCCCGGGC CGGGCAGGTG
GCCGCCGCGG CCCGCCGCAC GGGCCACCCG CTGGAGGTGG CGCCGCTCGC CGAGGTGGAC
AAGCACCTGG ACGTGGACCT GGTCGTGTCC TCCCTGCCCT CGGGCGCGGC CGACCTCCAC
GCCGACCTGC TCGCCGCCTC GCGCGCAGAC CTGTTCGACG TCGTCTACTC GCCCTGGCCC
ACCCGCGCCG CCGCGGCCGT CGCCGCGCGC GGCGGCCGCG TGGTCGGCGG CTTCCCCATG
CTCCTGCACC AGGCCGTCGA GCAGGTCCGC CTGATGACCG GTGTGGACGA CGTGCCCGTG
GAGGCCATGC GCGCGGCCGG TGAGGCTGAA CTGGCCCGCC GCTCCGTCCC CGCCTGA
 
Protein sequence
MGTVRAAVLG SPVAHSLSPV LHTAAYAAMG LDEWSYGLHE CGEEELAPFL AGLGGEWAGL 
SLTMPLKRRA LELAETVSDL ALQAGGANTL VHRGREWHAH NTDVAGITAA LAEAGADAPR
SAVVLGAGAT AASALVALRL LGLTAPVTVL ARDPARAGQV AAAARRTGHP LEVAPLAEVD
KHLDVDLVVS SLPSGAADLH ADLLAASRAD LFDVVYSPWP TRAAAAVAAR GGRVVGGFPM
LLHQAVEQVR LMTGVDDVPV EAMRAAGEAE LARRSVPA