Gene Ndas_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2017 
Symbol 
ID9245867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2437398 
End bp2438552 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content71% 
IMG OID 
Productoxidoreductase domain protein 
Protein accessionYP_003679949 
Protein GI297560975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.183817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATC ACGTGCTGGG TGTCGCCATG AACGGCGTCA CCGGACGCAT GGGGTACAGG 
CAGCACCTGA CCCGGTCTGT CCTGGCCATC CGGGAGGCGG GCGGCGTACG CCTGCCCGAC
GGCTCCCGGA TCATTCCCGA GCCCGTCCTG GTCGGACGCT CCGAGCACAA GCTGCGCGAG
ATCGCCGAAC GCCACGGAAT CGAGCGCTGG TCCACCGACC TGGACGGTGT GCTCTCCGAC
GACGACATCA CGGTCTACTT CGACTCGCAG ATCACACACG CCCGCGAGGC CGCCGTGCGC
GCGGCCATCG CCGCGGGCAA GCACGTCTAC GTCGAGAAGC CCACCGCCAG CACGCTCAGC
GCCGCCCTGG AGCTGGCCAA ACTGGCCCGC GACGCGGGCG TGCGCAACGG CGTGGTCCAG
GACAAGCTCT TCCTGCCCGG CCTGCTCAAA CTGCGCAGGC TGGTCGAGAG CGGCTTCTTC
GGCCGGATCC TGTCGGTGCG CGGCGAGTTC GGCTACTGGG TCTTCGAGGG CGACTGGCAG
CCCGCCCAGC GCCCCAGCTG GAACTACCGC GCCGAGGAGG GCGGCGGCAT GGTGCTGGAC
ATGTTCCCGC ACTGGCACTA CATCCTGGAG CACCTGTTCG GCCCGGTGCG CGCGGTCACC
GCCAAGGTGG CCACCCACAT CCCGCGCCGC TGGGACGAGG AGGGACGGCC CTACGAGGCC
ACGGCCGACG ATTCCGCCTA CGGCATCTTC GAGCTGGACG GCGGCGTCAT CGCCCAGATC
AACTCCTCCT GGAACGTGCG CGTGGCCCGC GACGAACTCG TGGAGTTCCA GGTCGACGGC
ACCCACGGCA GCGCCGTGGC GGGACTGCGC TCCTGCCGCG CCCAGCACCG CTCGGCCACG
CCCAAGGCGG TCTGGAACCC CGACCTGGAG GACCTGGGGC GCTACCGGGA GCAGTGGGAG
CCGGTGCCCG ACAACACCGA GTTCCCCAAC GGGTTCCGCG CCCAGTGGGA GGACTTCCTG
CGCCACGTGG TCCTGGACAC CCCCTTCCCG CACGACCTGC TCTCGGGCGC GCGCGGCCTC
CAGATGGCCG AGGCCGGACT CCAGTCGGCG CGCACCGGCC GCACGATCGA ACTGGACGAG
GTCACCCTCG CATGA
 
Protein sequence
MGDHVLGVAM NGVTGRMGYR QHLTRSVLAI REAGGVRLPD GSRIIPEPVL VGRSEHKLRE 
IAERHGIERW STDLDGVLSD DDITVYFDSQ ITHAREAAVR AAIAAGKHVY VEKPTASTLS
AALELAKLAR DAGVRNGVVQ DKLFLPGLLK LRRLVESGFF GRILSVRGEF GYWVFEGDWQ
PAQRPSWNYR AEEGGGMVLD MFPHWHYILE HLFGPVRAVT AKVATHIPRR WDEEGRPYEA
TADDSAYGIF ELDGGVIAQI NSSWNVRVAR DELVEFQVDG THGSAVAGLR SCRAQHRSAT
PKAVWNPDLE DLGRYREQWE PVPDNTEFPN GFRAQWEDFL RHVVLDTPFP HDLLSGARGL
QMAEAGLQSA RTGRTIELDE VTLA