Gene Ndas_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4388 
Symbol 
ID9248263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5222392 
End bp5223456 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content73% 
IMG OID 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_003682283 
Protein GI297563309 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCCG AAAGCACGGG GACGTCGGGC GCGTACGCGG CCGCCGGGGT GGACATCGCC 
GCCGGCGAAC GCGCCGTCGA CCTGATGAAG CGCCACGTGG CGCGCACCCG CAGGCCCGAG
CAGGTGACCG ACGCCAGCGG CTTCGCGGGC CTGTTCCGGC TCGACACGAA CAAGTACAAG
GACCCGGTCC TGGCGACCTC CACCGACGGC GTGGGCACCA AGGTGATGCT CGCCCAGCAG
ATGGACCGGC ACGACACGAT CGGCATCGAC CTGGTGGCGA TGGTCGTCGA CGACCTCGTG
GTCAGCGGCG CCGAGCCCCT GTTCATGACC GACTACGTCG CCTGCGGCGC GGTGGTGCCC
GAGCGCATCG CCGAGATCGT CGGCGGCATC GCCGAGGGCT GCCACCAGGC GGGCTGCGCG
CTGGTCGGCG GTGAGACCGC CGAGCACCCG GGCGCCATGG AGCCGGACGA GTACGACCTG
GCCGGTGCGG GCACCGGCGT GGTGGAGGGC GACGCGATCC TGGGCCAGGA CCGGGTCCGC
GAGGGCGACG CCGTCATCGC GATGGGCTCC TCGGGCCCGC ACTCCAACGG CTACTCGCTC
GTCCGCAGCA TCGTGGACCG GGCGGACCTG GACCTGTTCG CGCACGTCCC TGAGCTGGAC
GGGGTGCTGG GCGAAGTGCT GCTCACCCCG ACCCGGGTGT ACGCCAAGGA CTGCGTGGCG
CTGACCGCGG CAGTGGAGGT GCACGCCTAC GCGCACATCA CCGGCGGCGG GCTGGCGGCC
AACCTGGCGC GCTCACTGCC CGACCACCTG GACGCGGAGC TGGACCGCTC CACCTGGGCA
CCCGCCCCCG TGTTCGGCTA CCTGGCCGAC AAGGGGGGCG TGGGCCGGGA GGACATGGAG
GCCACGTTCA ACATGGGTGT GGGCATGGCG GCGATCGTCG CGGCGGACGA CGCCGAGCGC
GCCCTGCGGG TACTGTCCGA CCGCGGCGTC CCGGCCTGGC GGCTGGGCAC GGTGACGGCC
GGTTCGGGAC GGGCCGTCCT GACCGGCGAG TACCGCGGCG CGTGA
 
Protein sequence
MAAESTGTSG AYAAAGVDIA AGERAVDLMK RHVARTRRPE QVTDASGFAG LFRLDTNKYK 
DPVLATSTDG VGTKVMLAQQ MDRHDTIGID LVAMVVDDLV VSGAEPLFMT DYVACGAVVP
ERIAEIVGGI AEGCHQAGCA LVGGETAEHP GAMEPDEYDL AGAGTGVVEG DAILGQDRVR
EGDAVIAMGS SGPHSNGYSL VRSIVDRADL DLFAHVPELD GVLGEVLLTP TRVYAKDCVA
LTAAVEVHAY AHITGGGLAA NLARSLPDHL DAELDRSTWA PAPVFGYLAD KGGVGREDME
ATFNMGVGMA AIVAADDAER ALRVLSDRGV PAWRLGTVTA GSGRAVLTGE YRGA