Gene Ndas_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0091 
Symbol 
ID9243922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp115642 
End bp116793 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID 
Productdomain of unknown function DUF1745 
Protein accessionYP_003678048 
Protein GI297559074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.189874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGGT TCGGCGATGC ACTGACGACG GGGGCCGACC TCGTGAACGC GGCCGAGCGC 
GCCGTACTGA GTGCCCTGGA ACAGGTGGAC GGCCCTACCG ACCTGGTGTG CTTCTTCGTC
TGCGGCGCCG ACCCCGAGGA GGTCACCCTC GCGGGCAAGC GCGTCATGGA GCTGGCGGGC
GACGCGGCCA CCCTCGGATG CAGTTCCACC GGGGTCATCG GCGGCGGCCG CAGCGTCGAG
GGCCAGGGCT CGGTCAGCGT GTGGTGCGCC GGTCTGCCCG GCGTGGAGAT CACACCGTTC
CGACTGGACA CCGTGGTCGA GGACGACCAC CTGGCCGTCA TCGGCATGCA GGAGCCCGGC
CCCCGCGACA GCGTGGCCAT CCTGCTCACC AACCCCTACG AGTTCCCCAC CCAGGCCTTC
GTCCGCGAGT CCACCGAGGC CCTCGGCGGC CTGCCCCTCG TCGGCGGCAT GGCCGACGGC
ATGCGCGGTG AGGAGTCGGT GCGGCTCTTC TGCGACGGCG AGGTGGCCGA GCACGGCGCC
ATCGGCGTCC TCGTCGGCGG CGAGAACGTC CTCGGCACCG TCGTGAGCCA GGGCTGCCGC
CCCATCGGCT CGCCCATGAC CGTCACCAAG GCCGAGGGCA ACCTCCTGCT CGAACTCGCG
GGCACCAACG CCTACGAGAA GCTGGAGGAG CTGGTCGAGT CCCTCTCCGA GGAGGACCGC
GAACTCGCCG AGCACGGCCT GCACATCGGC ATCGCCATGG ACGAGTACGT CGACCGCCAC
GAGCAGGGCG ACTTCCTCAT CCGCACCCTG GCCGGAGCCG ACCCCGAACT CGGCGCCCTC
ACCATCGACG ACATGGTCGA GGTCGGCCAG ACCGTCCGCT TCCAGGTCCG CGACGCCGGT
ACCGCGGACG AGGACCTGGC CCGCCGCCTC AGCGACTTCG GCGCCGAACA CCCCGTCGGC
GCCGGTCTGC TCTTCTCCTG CAACGGCCGC GGGTCCTCCC TCTTCCCGCA GTCCGACCAC
GACGTCCTGG CCGTCCACCG CGTCCTCGGC GTCGACGCCG TCGCCGGGTT CTTCGCCGCT
GGCGAGATCG GCCCGGTCGG CGGGGTCAAC CACGTGCACG GGTTCACCGC CTGCCTGCTG
GCCTTCGCCT AG
 
Protein sequence
MARFGDALTT GADLVNAAER AVLSALEQVD GPTDLVCFFV CGADPEEVTL AGKRVMELAG 
DAATLGCSST GVIGGGRSVE GQGSVSVWCA GLPGVEITPF RLDTVVEDDH LAVIGMQEPG
PRDSVAILLT NPYEFPTQAF VRESTEALGG LPLVGGMADG MRGEESVRLF CDGEVAEHGA
IGVLVGGENV LGTVVSQGCR PIGSPMTVTK AEGNLLLELA GTNAYEKLEE LVESLSEEDR
ELAEHGLHIG IAMDEYVDRH EQGDFLIRTL AGADPELGAL TIDDMVEVGQ TVRFQVRDAG
TADEDLARRL SDFGAEHPVG AGLLFSCNGR GSSLFPQSDH DVLAVHRVLG VDAVAGFFAA
GEIGPVGGVN HVHGFTACLL AFA