Gene Ndas_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1011 
Symbol 
ID9244857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1235995 
End bp1237152 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content74% 
IMG OID 
Productoxidoreductase domain protein 
Protein accessionYP_003678960 
Protein GI297559986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00895095 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.937142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAGTC CCAGACCACC GATTCGCGTA GCCGTCGTCG GAAGCGGCGG TATCGCCCGG 
GGCAGGCACC TGCCCGCGCT CGCCGCCCTG GGGGACAGGG TCGAGGTGGT CGCGCTGGCC
GACCCCGACG CCTCCCGCGT GGCCGCGACC GCCGACGAGT GGGGCGTTCC CGGACGCCAC
ACCGGCCTCG ACGCCCTGCT GCGCGCCGAG TCCCCCGACC TGGTGATCGT GTGCACGCCG
CCCGTCGCGC ACAAGGACGC GGTGATCACG GCCCTGGACG CCGGGTGCTG GGTGTGGTGC
GAGAAGCCGC CGGCGCTGTC CCTGGCAGAG TACGACGAGG TCAGCACCCA CGAGGGAGGC
GAATCCGGTC CCGGCGGCGG AGGCGGTCCC TTCGTCAGCT ACGTGTTCCA GCACCGGTTC
GGCTCCGGCG CCGAGCGCCT GCGCCGCCAC CTGGCCGAGG GCACGCTCGG CCGTCCGCTC
GTCGGCGTGT GCAACACCCT GTGGTTCCGC GCCCCGGACT ACTTCGAGGT CCCCTGGCGC
GGACGCTGGG CGACCGAGGG CGGCGGCCCG AGCATGGGCC ACGGCATCCA CCAGATGGAC
CTCATGCTCT CCCTGCTCGG CGACTGGTCC GAGGTCACGG CCGTGATGTC CACGACCGCC
CGGTCCACCG AGACCGAGGA CGTGTCGATG GCGATCGTGC GCCTGGAGTC GGGCGCGACC
GTCTCCGTGG CCAACAGCCT GCTCTCCCCC CGCGAGACCA GTTACCTGCG CTTCGACTTC
GAGCACGCCA CGGTCGAGCT GGAGCACCTC TACGGCTACG ACAACGCGCA CTGGCGCTGG
ACCCCCGCGC CGCACGTGCG CGACGCCGAC GCGGTCGCGT CCTGGCCGCC GGTGGAGGAC
GAGCCGAGTT CGCACCGGGC CCAGCTCGCC GCGCTGCTCG ACGCGATGGA ACGCGGGGAG
CGGCCCCGCG CCAGCGGCCC CGACGGGCGG CGCGCCCTCG AACTCGTCAC GGGCATGTAC
CGGTCGGCCC TGACCGGCAC GACGGTGCGG CGCCGGGACC TGACCCCCGA CGACGGCTTC
TACCACGCGA TGCACGGGGG CGACGCGGAC ACCGCCGCGG CCGTCCTCAC CAGGACGGAG
GAGACCACAG GTGTCTGA
 
Protein sequence
MPSPRPPIRV AVVGSGGIAR GRHLPALAAL GDRVEVVALA DPDASRVAAT ADEWGVPGRH 
TGLDALLRAE SPDLVIVCTP PVAHKDAVIT ALDAGCWVWC EKPPALSLAE YDEVSTHEGG
ESGPGGGGGP FVSYVFQHRF GSGAERLRRH LAEGTLGRPL VGVCNTLWFR APDYFEVPWR
GRWATEGGGP SMGHGIHQMD LMLSLLGDWS EVTAVMSTTA RSTETEDVSM AIVRLESGAT
VSVANSLLSP RETSYLRFDF EHATVELEHL YGYDNAHWRW TPAPHVRDAD AVASWPPVED
EPSSHRAQLA ALLDAMERGE RPRASGPDGR RALELVTGMY RSALTGTTVR RRDLTPDDGF
YHAMHGGDAD TAAAVLTRTE ETTGV