Gene Ndas_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2044 
Symbol 
ID9245894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2465706 
End bp2467076 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content72% 
IMG OID 
ProductGlutamate dehydrogenase (NADP(+)) 
Protein accessionYP_003679976 
Protein GI297561002 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.859407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.109712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCG CCGACGAGGC AGGCCGTGAC CACCACCCCG GAGTCCGATC CGCCTACGAG 
CACGTCGTGC GCCGCAACCC CGACGAGCCG GAGTTCCAGC AGGCGGTGCT GGAGGTCCTG
GACGACCTCT CCCCGGCCCT GACCCGGCAC CCCGAGTACG CCGAACAGCG CATCCTGGAG
CGCCTGTGCG AGCCCGAGCG CCAGGTGGTC TTCCGGGTGC CCTGGCGCGA CGACCAGGGC
CGGGTGCACG TCAACCGGGG CTTCCGGGTG GAGTTCAACA GCGCGCTGGG CCCCTACAAG
GGCGGGCTGC GCTTCCACCC CAGCGTCAAC CTCGGCGTGA TCAAGTTCCT GGGCTTCGAG
CAGATCTTCA AGAACGCCCT GACCGGCATG AACATCGGCG GCGGCAAGGG CGGCAGCGAC
TTCGACCCCA AGGGCCGCTC CGAGGCGGAG ATCGAGCGGT TCTGCCAGTC GTTCATGACC
GAGCTGCACC GCCACCTGGG TGAGCACACC GACGTCCCGG CCGGGGACAT CGGCGTGGGC
GGCCGCGAGA TCGGCTACCT GTTCGGCCAG TACCGGCGCC TGGCCAACCG CTGGGAGGCC
GGGGTGATCA CCGGCAAGGG CCTGGAGTGG GGCGGCTCAC GGGTGCGCAC CGAGGCCACC
GGGTACGGCA GCGTGCTCTT CACCCAGCGG ATGCTGGAGC GGTCGGACCG CGCTCTGGAC
GGCCAGCGCG TGGTGGTCTC CGGGTCGGGC AACGTGGCGA TCTACTCGGT GGAGAAGGCC
CAGCAGCTGG GCGCGAACGT GGTGGCCGTC TCCGACTCCG GCGGCTACGT GGTGGACGAC
AAGGGCCTGG ACCTGGACCT GCTCAAGCAG GTCAAGGAGG TCGAACGCGC CCGGATCAGC
GTCTACGCCG AGCGTCGCGG CGGGGGCGCC CACTACGTCG AGGGCGGCAG CATCTGGGAC
GTGCCGTGCG ACGTGGCGCT GCCCTCGGCC ACCCAGAACG AGCTGGACGC CGACGCGGCG
CGCACACTGG TGCGCAACGG CGTGGTGGCG GTGGCGGAGG GCGCGAACAT GCCCACCACA
CCCGAGGCGG TCAAGGTGCT GCGCGAGGCC GGTGTGCTGT TCGCGCCGGG CAAGGCGGCC
AACGCCGGCG GTGTGGCCAC CAGCGTGCTG GAGATGCGCC AGAACGCGCG GCGCACGTCG
TGGTCCTTCG AGCACGCCGA GGCCGAGCTG GCCGAGACGA TGGCGGGCAT CCACGACCGC
TGCGAGGAGG CGGCCGAGCG CTACGGGGCG CCGGGCGACT ACGTGCTGGG CGCCAACGTG
GCCGGGTTCG AGCGGGTGGC GGGCGCGATG CTGGCCCAGG GCCTGATCTA A
 
Protein sequence
MPFADEAGRD HHPGVRSAYE HVVRRNPDEP EFQQAVLEVL DDLSPALTRH PEYAEQRILE 
RLCEPERQVV FRVPWRDDQG RVHVNRGFRV EFNSALGPYK GGLRFHPSVN LGVIKFLGFE
QIFKNALTGM NIGGGKGGSD FDPKGRSEAE IERFCQSFMT ELHRHLGEHT DVPAGDIGVG
GREIGYLFGQ YRRLANRWEA GVITGKGLEW GGSRVRTEAT GYGSVLFTQR MLERSDRALD
GQRVVVSGSG NVAIYSVEKA QQLGANVVAV SDSGGYVVDD KGLDLDLLKQ VKEVERARIS
VYAERRGGGA HYVEGGSIWD VPCDVALPSA TQNELDADAA RTLVRNGVVA VAEGANMPTT
PEAVKVLREA GVLFAPGKAA NAGGVATSVL EMRQNARRTS WSFEHAEAEL AETMAGIHDR
CEEAAERYGA PGDYVLGANV AGFERVAGAM LAQGLI