Gene Ndas_0802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0802 
Symbol 
ID9244647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp988245 
End bp990545 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content74% 
IMG OID 
Product5- methyltetrahydropteroyltriglutamate/homocysteine S-methyltransferase 
Protein accessionYP_003678752 
Protein GI297559778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA CCCCCGCGCC GTCCGCGGCC CCGGGCGGAC CACGCCCCTC CGTGCGCTCG 
ACCGTGCACG GCTACCCGCG CATCGGCCCC GCCCGCGAGC TCAAGCGCGC CTGCGAGTCC
CACTGGAGGG GCCGCACCAC CGCCCGCGAA CTGGACGAGG TCGCCGCCGG GCTGCGCCTG
GGCGTCTACG CCCAGCTGCG CGAGGCCGGT GTCGACGACA TCCCCTCCAA CACGTTCTCC
TACTACGACC ACGTGCTGGA CACCGCGGTC CTGTTCGACC TGGTCCCCTC CCGCTTCGCC
CCTCCCGCCG CGGCGGCGGA CCGCGAGGAG GAACTGCGCC GCTACTTCGC GCTCGCCCGC
GGCGAACAGG GCGCCCCGCC GCTGGAGATG ACCAAGTGGT TCGACACGAA CTACCACTAC
CTGGTCCCCG AGCTCTCGCC CGGCAGCCGT CCCCGCCTGG TGGGCGACAA GCCCGTCGCC
GAGTTCCGAG AGGCGGCCGA GGCGGGTTTC CACACCCGCC CGGTCCTGGT CGGTCCTCTC
ACCTTCCTGC TGCTGGCCAA GCCCGCCGAC GGCGCCCCCG AGGGCTGGGA ACCCGTCCAG
CTGCTCGACC AGCTCCTGGA GGCCTACGCC CAGCTGCTCG GCGACCTGCG GGCCGCGGGC
GCCACCGAGG TCCAGCTGGA CGAGCCGATC CTGGCCACCG ACGAGGGCCG CGCCGCCGTC
GGCCACCTGG AGCGCGCCTA CCAGTACCTG GGCGCGGTCA CCGACCGCCC CTCCCTGCTG
GTGTCCACCT ACTTCGGCTC CATCGGCTCC GCCGCCCTGC GCGTCCTCAA GGACTCCGCG
GTCGAGGCGG TCGGCCTGGA CCTGGTCACC GACGACGAGG GCGTGGACGA CCTGGTGCGG
GTCTCCGGCC TGGGCGCCAC CCGCCTGGTC GCGGGCGTGG TGGAGGGGCG CAACGTCTGG
CGCACCGACA TCCCGGCCGC CGTCGCCGCG CTCGGTACCC TGCTCGCCCT CACCGACGAG
CTGACCGTGA GCACCTCCTG CTCCCTGCTG CACGTGCCCC TGGACCTGGA CGCCGAGCCC
TCCCTGGCCC CCGAGCTGCG CGGGGCCCTG GCCTTCGCCA AGCAGAAGGC CGAGGAGACG
GCCCTGCTCG GCCGCGTGCT GTCCGAGGGC GCCGAGTCCG AGCACACCGG ACGGGGCGGC
GTCCGTCCCC CCGCCTTCAC CGACGCGCGC GTGCGCGCCC GCCTCGACGC CCTGGGCCCG
GACGCCTACG AGCGCCCGCG GGAACGCGGC AGGCCCACCG ACACCCCGCT CACCACGACC
ACCATCGGGT CCTTCCCGCA GACCGCCGAG CTGCGCCGCG CCCGCGCCGC GCACCGCAGG
GGGGAACTGG CGGAGGACGC CTACAAGGCC GTCCTGCGCG AGGAGATCGA CCGGGTCGTC
GCGCTCCAGG AGGAGATCGG ACTGGATGTG CTCGTGCACG GCGAGCCCGA GCGCAACGAC
ATGGTCCAGT ACTTCGCCGA GCAGCTGGAG GGCTACGCCA CCACCGAGAA CGGCTGGGTG
CAGTCCTACG GTTCGCGGTG CGTGCGCCCG CCGATCCTGT TCGGCGACGT CTCGCGCCCC
CAGCCGATGA CGGTCGAGTG GACCACCTAC GCCCAGTCGC GCACCGACAA GCCGGTCAAG
GGCATGCTGA CCGGTCCGGT CACCATGCTC GCCTGGTCGT TCGTGCGCAC CGACCAGCCG
CTGGGCGAGA CCGCACGCCA GGTGGCCTTG AGTTTGCGCG ACGAGGTCGC CGACCTGGAA
CGCGCCGGGA TCCGCCATAT CCAGGTGGAC GAGGCGGCCC TGCGCGAACT GCTGCCGCTG
CGCTCGGAGC ACCGGGCGCG GTACCTGGAC TGGGCGGTGG GCTCCTTCCG CCTGGCCACG
TCGGGGGTGT CGCCGTCCAC GACCATCCAC ACGCACATGT GCTACTCGGA GTTCGGGCTG
ATCGTCGGCG GCATCGAGGC GCTGGACGCC GACGTCACCA GCGTGGAGGC CGCCCGCTCG
CGCATGGAGC TGGTGCGGGA CCTGGGCGAG CGCGGCTACG GGCGCGGGAT CGGCCCGGGC
GTGTACGACA TCCACTCCCC GCGGGTGCCC TCGGTGGAGG AGATCGAGGG GGCGCTGCGG
CTGGCGGTCG CGCACATCGA CGCCCGGAAC CTGTGGGTGA ACCCCGACTG CGGCCTCAAG
ACGCGCGGTT ACGCCGAGGC CGAACAGGCC CTGCGCAACA TGGTCGAGGC GGCCCGCCGG
GTGCGCGCCG ACCTGGCGTG A
 
Protein sequence
MTTTPAPSAA PGGPRPSVRS TVHGYPRIGP ARELKRACES HWRGRTTARE LDEVAAGLRL 
GVYAQLREAG VDDIPSNTFS YYDHVLDTAV LFDLVPSRFA PPAAAADREE ELRRYFALAR
GEQGAPPLEM TKWFDTNYHY LVPELSPGSR PRLVGDKPVA EFREAAEAGF HTRPVLVGPL
TFLLLAKPAD GAPEGWEPVQ LLDQLLEAYA QLLGDLRAAG ATEVQLDEPI LATDEGRAAV
GHLERAYQYL GAVTDRPSLL VSTYFGSIGS AALRVLKDSA VEAVGLDLVT DDEGVDDLVR
VSGLGATRLV AGVVEGRNVW RTDIPAAVAA LGTLLALTDE LTVSTSCSLL HVPLDLDAEP
SLAPELRGAL AFAKQKAEET ALLGRVLSEG AESEHTGRGG VRPPAFTDAR VRARLDALGP
DAYERPRERG RPTDTPLTTT TIGSFPQTAE LRRARAAHRR GELAEDAYKA VLREEIDRVV
ALQEEIGLDV LVHGEPERND MVQYFAEQLE GYATTENGWV QSYGSRCVRP PILFGDVSRP
QPMTVEWTTY AQSRTDKPVK GMLTGPVTML AWSFVRTDQP LGETARQVAL SLRDEVADLE
RAGIRHIQVD EAALRELLPL RSEHRARYLD WAVGSFRLAT SGVSPSTTIH THMCYSEFGL
IVGGIEALDA DVTSVEAARS RMELVRDLGE RGYGRGIGPG VYDIHSPRVP SVEEIEGALR
LAVAHIDARN LWVNPDCGLK TRGYAEAEQA LRNMVEAARR VRADLA