Gene Ndas_5569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5569 
Symbol 
ID9249472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp772491 
End bp773837 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content77% 
IMG OID 
ProductROK family protein 
Protein accessionYP_003683454 
Protein GI297564481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.575796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.481694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTCG GTTCGGGAAC GGTGAGCTTC CAGACGGTCC GGGAGACCAA CCTGGGCGTC 
GTGCTGCGCA CGGTCCGCGA GCTGGCCCCC TGCTCGCGCG CGGCGGTGGC CGCGGCCACC
GGGCTGAACA AGACCACGGT GTCCAGCCTG GTCGCCGACC TCATGGCGCG CAGGCTGGTC
CGCGAGACCG GGCGCTCCTC CCGGCAGCGG GTGGGGCGCC CGGGGGTGCT GCTGGACCTG
GACGACTCCT CGATCGCGGC GATCGGCCTG GAGGTCAACG TGGACTACCT GTCGGTGGTC
GCCGTGGACC TGCTCCAGCG CGAACTGGTC AGCCGCCACG TCCCCTTCGA CGCCCGGTCG
GCCGGGGCGG AGGCGTGCGC GCGGCACATC GCGCGCACCC TGGGCGCGAC GGTGGCCGAC
CCCGCGCTGC GCGGGCGCAC CGTGGTGGGG GTGAGCGTGG CGGTCCCGGC GCTGATCGAC
GCGCCCTCGG GGACGGTCAC GCACGCGCCC AACCTGGGCT GGCGCGACGT GCCGCTGCGG
GACCGGCTGT CGGAGCTGCT GCGCGAGGCC GGGGTGGAGG GCGTCCCGGT GCGGGTGGAC
AACGACGCGA ACCTGGGCGC GGTGGCCGAG TACCGGGTGG GGTCGTTCGC GGGGACGGCC
GACCTCGCGT ACCTGACCGG TGAGGTCGGG ATCGGCGCGG GCATCCTGAC CGGGGGCGGA
CTGCTGCGCG GCGCCAGCGG GTTCGCGGGC GAGGTGGGCC ACCTGTCGCT GGCCCCGGAC
GGCCCGGAGT GCGCGTGCGG GCGCCGCGGC TGCCTGGAGG CGCTGGCGGG GATCGGGGCC
ATCCTGCGCG GGGCCGTCCC CGACCGGTTC CCGGACCACC CGCTGTCGGG CAGCGACGTC
GCCGAGCTGG TGGGGACGGC CGTGGCACGC GCCGAGGCGG GCGAGGACAC CGCCGTCGGC
GCGCTGGAGC GGGCGGGCAC GTGGCTGGGC CGGGGCCTGG CCGTGCTGAT CAACGTCACC
AACCCGAGCC TGGTGGTGCT GGGCGGCTAC TTCGTGCCCC TGGGCCCGTG GCTGCTGCCG
AACTGCCGGG CGGAGGCGGC CGCGAGCGCG TTCGCGCCGG AGGCGGGCGG CTGCCGGGTG
GAGCTGTCGT CGCTGGGGCT GAGCGCGGCG GCCCGGGGCG GGGCCACCGC GATGATCCAC
TCGCTCGACG CGGGACTGCT GCCCCTGCCC GAGCCCGTGA CCCGGGCTCC CGATCCCGCG
TCCGGCGAGG GGCCCGCCGC GGAGCCCGTC GAGCACCCGG CGGCGGACAC GGCGCAGCCG
GACGGGAACA CCGCGGACAC CGCCTAG
 
Protein sequence
MSVGSGTVSF QTVRETNLGV VLRTVRELAP CSRAAVAAAT GLNKTTVSSL VADLMARRLV 
RETGRSSRQR VGRPGVLLDL DDSSIAAIGL EVNVDYLSVV AVDLLQRELV SRHVPFDARS
AGAEACARHI ARTLGATVAD PALRGRTVVG VSVAVPALID APSGTVTHAP NLGWRDVPLR
DRLSELLREA GVEGVPVRVD NDANLGAVAE YRVGSFAGTA DLAYLTGEVG IGAGILTGGG
LLRGASGFAG EVGHLSLAPD GPECACGRRG CLEALAGIGA ILRGAVPDRF PDHPLSGSDV
AELVGTAVAR AEAGEDTAVG ALERAGTWLG RGLAVLINVT NPSLVVLGGY FVPLGPWLLP
NCRAEAAASA FAPEAGGCRV ELSSLGLSAA ARGGATAMIH SLDAGLLPLP EPVTRAPDPA
SGEGPAAEPV EHPAADTAQP DGNTADTA