Gene Ndas_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3964 
Symbol 
ID9247835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4740154 
End bp4741722 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content73% 
IMG OID 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003681867 
Protein GI297562893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAGC AGGCCATCAG GCGTGCGTTG ATCAGCGTCT ACGACAAGAC CGGTCTGGAG 
GAGCTGGGCA TCGGCCTGGC CGAGGCCGGG GTGGAGATCG TCTCCACCGG CTCCACCGCC
GCGCGGCTGC GCGCCGCCGA CATCCCCGTC ACCCCCGTGG AGGACGTCAC CGGCTTCCCC
GAGATCATGG AGGGTCGCGT CAAGACGCTG CACCCCTCCG TGCACGCCGG GCTCCTGGCC
GACCAGAACA ACCCCGAGCA CGTCGCCAGG ATCAAGGAGC TGGGCATCGC CCCCTTCGAC
CTGGTCGTGG TCAACCTCTA CCCCTTCCAG GACACCGTCG CCTCCGGCGC CTCCGAGGCC
GACTGCATCG AGAAGATCGA CATCGGCGGC CCCGCCATGG TGCGCGCCTC GGCCAAGAAC
CACGGCAGCG TCGCGATCGT GGTCGACCCG GGCAGCTACG ACGCCGCCCT GGAGGCCGTC
CGCGACGGCG GCTTCACCCT GGAGCAGCGC AAGCGCCTGG CCGGGCTCGC CTTCCAGCAC
ACCGCCGCCT ACGACGCGGC CGTCGCCGAC TGGTTCGCGG CCGACTACGC CCCCGACACC
GAGGCCGCCG AGTCGGGCTG GCCCGGTTTC CTGGCCGCCG TCCACCACCG CAGGGACGTC
CTGCGCTACG GCGAGAACCC CCACCAGAAG GCGGCGCTGT ACACCGTCGC GGGCGCCCCG
CGGACCGGCC TGGCCGGAGC CGAGCAGCTC CACGGCAAGG CGATGTCCTA CAACAACTAC
GTGGACGCCG ACGCCGCGCT GCGCGCCGCC CACGACTTCG ACCAGCCGTG CGTGGCCATC
ATCAAGCACG CCAACCCGTG CGGGATCGCC GTCGGCGCCG ACAACGCCGA GGCCCACCGC
AGGGCGCACG CCTGCGACCC GGTGTCGGCC TTCGGCGGCG TCATCGCCAC CAACCGCCCC
GTCGGCGAGG AGCTGGCCGG GCAGATCGCG GAGATCTTCA CCGAGGTCGT CGTCGCCCCC
GCCTTCGAGC CCGCGGCCGT GGAGATCCTC AGCCGCAAGA AGAACATCCG CCTGCTCGTG
GCGCAGGGCT CCGGCCCCGG CGCGGGCGTG GAGCACCGCC AGATCAGCGG CGGCCTGCTG
GTGCAGTCGC GCGACGCCAT CGACGCCCCC GGCGACGACC CCTCCACCTG GACCCTGGCC
ACCGGCGAGC CCGCCGACGA GGCCACCCTG GCCGACCTGG CCTTCGCCTG GAAGGCCGTG
CGCGCGGTCA AGTCCAACGC CATCCTCCTG GCCTCCGGCG GCGCCACCGT GGGAGTGGGC
ATGGGCCAGG TCAACCGCGT GGACTCCGCA CGCCTGGCGG TCACGCGCGC GGGCGAGAGG
GTGACGGGGT CCGTCGCGGC CAGCGACGCC TTCTTCCCCT TCCCCGACGG CCTGGAGATC
CTCACCGGGG CGGGGGTCCG CGCCGTCGTC CAGCCCGGCG GCTCGGTCCG CGACGAGGAG
GTCGTGGCCG CGGCCAAGGC GGCCGGCGTG ACCATGTACC TCACCGGGAC CCGGCACTTC
TTCCACTGA
 
Protein sequence
MTQQAIRRAL ISVYDKTGLE ELGIGLAEAG VEIVSTGSTA ARLRAADIPV TPVEDVTGFP 
EIMEGRVKTL HPSVHAGLLA DQNNPEHVAR IKELGIAPFD LVVVNLYPFQ DTVASGASEA
DCIEKIDIGG PAMVRASAKN HGSVAIVVDP GSYDAALEAV RDGGFTLEQR KRLAGLAFQH
TAAYDAAVAD WFAADYAPDT EAAESGWPGF LAAVHHRRDV LRYGENPHQK AALYTVAGAP
RTGLAGAEQL HGKAMSYNNY VDADAALRAA HDFDQPCVAI IKHANPCGIA VGADNAEAHR
RAHACDPVSA FGGVIATNRP VGEELAGQIA EIFTEVVVAP AFEPAAVEIL SRKKNIRLLV
AQGSGPGAGV EHRQISGGLL VQSRDAIDAP GDDPSTWTLA TGEPADEATL ADLAFAWKAV
RAVKSNAILL ASGGATVGVG MGQVNRVDSA RLAVTRAGER VTGSVAASDA FFPFPDGLEI
LTGAGVRAVV QPGGSVRDEE VVAAAKAAGV TMYLTGTRHF FH