Gene Ndas_4636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4636 
Symbol 
ID9248517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5506846 
End bp5508096 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content74% 
IMG OID 
Productphosphoribosylamine/glycine ligase 
Protein accessionYP_003682528 
Protein GI297563554 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.728715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.385617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGCCC TCGTTCTCGG CGGCGGAGGC CGCGAGCACG CTCTGGTCCG CGCCCTGTCC 
CTGGACCCGG GTGTCACCAG CATCCACAGC GCCCCGGGCA ACCCCGGCAT CTCGGAGCTG
GCCGAGAACC ACGTGCTCAA CGTGACCGAC GGCCTGGCCG TCACCGAGCT GGCCGCGCGC
ATCCGCGCCG AGCTGGTCGT CATCGGACCG GAGGCCCCGC TGGTCTCCGG TGTGGCGGAC
GCCCTGCGCG ACCGGGGCAT CCCGGTGTTC GGCCCCGACC AGGAGGCCGC ACGCCTGGAG
GGCTCCAAGG CCTTCGCCAA GGAGGTCATG GAGGCCGCCG GGGTGCCCAC CGCCAAGGCG
CGGGTGTGCA GGACCGCCAG CCAGGTGTCC GAGGCCCTCG ACGAGTTCGG CACGCCCTAC
GTGGTCAAGA ACGACGGCCT GGCCGCGGGC AAGGGCGTCG TGGTGACCGA GGACCGCGCC
CTCGCCGAGC AGCACGCCCG GGAGTGCGGC CGCGTGGTCA TCGAGGAGTT CCTCGACGGC
CCCGAGGTGT CCCTCTTCGT GCTGAGCGAC GGGCTGCACG CCCTGCCGCT GCTGCCCGCC
CAGGACTTCA AGCGCGCCTA CGACGGCGAC CAGGGCCCCA ACACGGGCGG CATGGGCGCG
TACGCGCCGC TGCCGTGGGC CCCGGCGGGC CTGGTGGACG AGGTGATGGA GTCGGTCGTG
CGGCCGACCC TGGTGGAGAT GAACCGGCGC GGTAAGCGCT ACCAGGGCCT GCTGTACGTG
GGGCTGGCGC TCACGTCGCG GGGTCCGCGC GTGGTGGAGT TCAACGCCCG GTTCGGCGAC
CCGGAGACCC AGGTGGTCCT GGACAGGCTG GCCACCCCGA TCGGCGCCGT CCTCCAGGCC
ACCGACACCG GCGGCCTGGG GGGCATCGGC TCCCTCCAGT GGAAGTCGGG CGCCGCGGTC
ACCGTGGTGG TCGCCGCCGA GAACTACCCG GGCGACCCGG TCAAGGGCGA CGTCATCGGC
GGCCTGGACC AGGCCAACGC GATGGAGGGC GCGTACGTGC TGCACGCGGG CACCGACTGG
GAGGGCTCGG GCGGCGTCAA GGCGAGCGGA GGCCGGGTGC TCAACGTGGT CGGCACCGGG
ATCGACCTGC GCCAGGCGCG CGAGCGGGCC TACGAGGCCG TGGCGCGCAT CGAGCTGCGC
GGCTCGTTCC ACCGCACCGA CATCGCCGAG CGCGCCGCGG CCGAACTGTA G
 
Protein sequence
MKALVLGGGG REHALVRALS LDPGVTSIHS APGNPGISEL AENHVLNVTD GLAVTELAAR 
IRAELVVIGP EAPLVSGVAD ALRDRGIPVF GPDQEAARLE GSKAFAKEVM EAAGVPTAKA
RVCRTASQVS EALDEFGTPY VVKNDGLAAG KGVVVTEDRA LAEQHARECG RVVIEEFLDG
PEVSLFVLSD GLHALPLLPA QDFKRAYDGD QGPNTGGMGA YAPLPWAPAG LVDEVMESVV
RPTLVEMNRR GKRYQGLLYV GLALTSRGPR VVEFNARFGD PETQVVLDRL ATPIGAVLQA
TDTGGLGGIG SLQWKSGAAV TVVVAAENYP GDPVKGDVIG GLDQANAMEG AYVLHAGTDW
EGSGGVKASG GRVLNVVGTG IDLRQARERA YEAVARIELR GSFHRTDIAE RAAAEL