Gene Ndas_5421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5421 
Symbol 
ID9249324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp601159 
End bp603774 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683306 
Protein GI297564333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACCT TCTCTCTCGT CTTCGTCCCC CGCGACTTCC ACTACGACTC CAAACCCGTC 
TACGACTGCG CGGCGGACCT GGAGGTCCTC AACGCGCGCC TCCTCGGCCG CGCCGACGAC
CTCAGCGGAA GCTTCGACTC GGCCGCGGGC GAGTTCACCG ACGTCATCGC CTGGGACATC
CGGAGCCTGT CCGAGGAGGA CCTCCAGAAC TGGCGGGACG CCGCCGTCTC GGTCTCCTAC
GCGGCCTCGG TGGCCGAGAG GTGGGGGGAC ATCGTGAAGG CCTTCCACGA GGAGCGCGAC
TCCCAGGTCA CCGCCTGGGA GAGCAGCAGG ACGGAGAAGG AGAACGCCGT CCCCGCCAAG
TACCAGGACG ACCACATCAC CTCGTCCCAC CCCCAGGCCG ACGGCTTCCT GTGGACGGAC
TGGGGCGCGG GGGACGAGGC GCGGTGCCGG GCCCTCTACG ACGAGCTCGC CACCGTGCAG
GAGGGGCTCA TCAGCCAGGA GCAGACCAAC TGGCAGAAGC TCCAGGACGG CGCCGAGGAC
GTCCGGACGA TGCTCGAACA GGGCCCGACC CCGGAGAACG TCCGCAAGCT CATGGACACG
GGCAACGCCA ACTGGGCGTT CCTCAACCTG GACCCCTCCC GCTACTCCTC GCTCATCGAC
GACAGCGAGC TCACGCCCGA GAACGCCGAG CAGTACGCCG ACGAGCTCGC CGCCTACTGG
TCCGGTGACA AGCCGCTGGA CGACCGCTAC CACGAGATGA TGCTCGTGCT CACGATGGTG
GGGACCGGCG CCCGGCAGAA CCAGCAGGAC GGCACGGAGC TCAGTCCCGA GGAGATCGCG
TTCCTGGAGG AGTTCTACGC CCAGCTGGAG GAGCCCTACC GCAGGGACGG GGTCGGGGCG
GGCATCATGG TCTACCCCGA CCTCATGAAC GAGTCCGGCA TGAGCGACGA GGAGCGGGAG
GACGCCCTCG GCGTCCTCGG CAACGGGCTG CTCGCGCTCT CGGACCCCGC GCTCGGCGGA
GGGTACGACA ACCTGCCCGA GAGCTTCCGG TACGCCGTCG AGGGCTCCTG GATCAACCCC
GACGCCGAGA ACAAGCTCCC CGGCACGCCC GTGAGCATGG GGATGGACAT GAAGGCGCTC
TCCGCCTTCA TGGAGCACAC GGACGAGGGC CTCCAGGGCG GCTACGGGCT GTCGACCAAC
CTGCACCTGA CCACCGGCGC GTTCCTCGAC GCCTGGGGCG ACGACCCCGA CCCCGACGGG
GTGCTGCCCG ACTCCGAGCA GGTCTCCCAC ATCATCGACG TGGCCAGCCG CAACACCGAC
GCCAACTACT ACATGCTCAC CGGCGAGCAC ATCAACGCGG AGGAGGGCGT CGACCACGGC
GACGAGGACC TGCGCACGCG GGCGCTGGAG GGCACGCTCA CCCACGAGTG GCACGACGAC
GGCCGCACCG CGCGCCAGCT CACCGACTGG CTGGCCGAGG ACATCCACAG CGAGGACCCC
GACGTGCGCC AGCGGGCCGG GGACGGTTTC GCCGGGTTCA TGGAGACCAT CACCGACAAG
GACATGCACG AGGCCCTCGT CAACACCGGC GTGGACGTGA CCGAGGGGGA CAACGAGTAC
TCCAACGCCT CGTTCACCCA GTTCAACGGG GAACTCGCCG ACAGCCTCGC CGACATCTTC
GACGCCCACA TCTACAGCTT CGCCGACAGT GACGTGCTGC ACGACAACGA GCCGGTGACG
GGCATCGAGG ACTTCGACCC GGACAAGAGT TTCGTCTCCA TGGGTCCGGA GGAGCGCGCC
ATGTACATGC AGCTCCTGAT GGGCAACGAC GAGACCGCCG GACGTGTGGT CAACTCCGTC
GACGTCTACC AGCAGATCGA GGCCGCCGCC TTCTTCGGAA ACGGACAGGC CGAGGAGACG
GCACGCGGTG GAGGCCAGCT CCAGGCCCTT CTGGAGGAGG CGCTGAGAAG GGACTCGGCC
GACCGGACGG CCGACCTCGA CGAGCAGATC GACCGGAAGA CGCAGATCAC CGAATTCGTG
GTGGGCGAGG CCGGAGGCAT GTCCGAGAAG ATCCCGGTCA TCGGCGCCGC TGTCGCGAAG
GGGCTGGAGC TGGAAGAGGA GAGCATCGTC GAGGCGATCG TCAACGGCGA GTACGAGGTC
TCTCCACGCC ACCCGACCTT CTCCGGGGCC GAGTACATCG AGCGCAACTT CCGCATCGAG
GCTCTCGACT ACCTGTCGCA GAAGGATCCC GAAGCGCTCA ACGGCGTCAT CCGTCCCGAC
GAGTTCCGCA CCCTGATCAA CGGCGGCGCC ATCACCATCA CGCAGGACGG TGTGCGGCTC
GACCCGGAGG ACATCGGAGA CGGCTTCGTC TTCGACGACT CCGTCACGGT CGACGTGGAG
AAGAACCCGA ACGAGTGGTC GGACAACAGG AACCAGGACC TCGACCGCAT GGACGGCGCC
ATCAGCAACA TCCTGGTCGA CGTGGAGATC ACCACCAGCG ACGGCCGCAC CCAACCGGGC
GACAGGCGCG TGTCCGACTT CGTCACCCAG TACAACAACT CCTACGACGA CACGAACAGC
GTCTTCGCGG GCCAGGAGGA GGAAGAGGAG GGTTGA
 
Protein sequence
MATFSLVFVP RDFHYDSKPV YDCAADLEVL NARLLGRADD LSGSFDSAAG EFTDVIAWDI 
RSLSEEDLQN WRDAAVSVSY AASVAERWGD IVKAFHEERD SQVTAWESSR TEKENAVPAK
YQDDHITSSH PQADGFLWTD WGAGDEARCR ALYDELATVQ EGLISQEQTN WQKLQDGAED
VRTMLEQGPT PENVRKLMDT GNANWAFLNL DPSRYSSLID DSELTPENAE QYADELAAYW
SGDKPLDDRY HEMMLVLTMV GTGARQNQQD GTELSPEEIA FLEEFYAQLE EPYRRDGVGA
GIMVYPDLMN ESGMSDEERE DALGVLGNGL LALSDPALGG GYDNLPESFR YAVEGSWINP
DAENKLPGTP VSMGMDMKAL SAFMEHTDEG LQGGYGLSTN LHLTTGAFLD AWGDDPDPDG
VLPDSEQVSH IIDVASRNTD ANYYMLTGEH INAEEGVDHG DEDLRTRALE GTLTHEWHDD
GRTARQLTDW LAEDIHSEDP DVRQRAGDGF AGFMETITDK DMHEALVNTG VDVTEGDNEY
SNASFTQFNG ELADSLADIF DAHIYSFADS DVLHDNEPVT GIEDFDPDKS FVSMGPEERA
MYMQLLMGND ETAGRVVNSV DVYQQIEAAA FFGNGQAEET ARGGGQLQAL LEEALRRDSA
DRTADLDEQI DRKTQITEFV VGEAGGMSEK IPVIGAAVAK GLELEEESIV EAIVNGEYEV
SPRHPTFSGA EYIERNFRIE ALDYLSQKDP EALNGVIRPD EFRTLINGGA ITITQDGVRL
DPEDIGDGFV FDDSVTVDVE KNPNEWSDNR NQDLDRMDGA ISNILVDVEI TTSDGRTQPG
DRRVSDFVTQ YNNSYDDTNS VFAGQEEEEE G