Gene Ndas_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2998 
Symbol 
ID9246851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3582023 
End bp3583561 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content78% 
IMG OID 
Productformiminoglutamate deiminase 
Protein accessionYP_003680914 
Protein GI297561940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.558818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG GCGCGGACTC CGGCCCCGCC GCCTCCGCCC CCGGCGGGAG CGCCGGTCCT 
GCGGCATGCG GAACCGCCTC CGGGCGTTCC CGGGGGCCGC GCGACCACCG CCTGTGGTGC
GAGTGGGCGT GGACCGGCGC CGAGGACGGC ACTCCCGAGC ACGGCGTGCT CGTGGAGGTG
GCCGACGGCC GCATCACGTC GGTGACGGTC GCGACGCCGC GCCCCGGGGA CGCCGAGACC
CTCACCGGCC TCACCCTGCC CGGGCTCGCC AACGCCCACT CGCACGCCTT CCACCGGGCG
CTGCGGGGGC GCACCCACGC CGGCGGCGGA TCCTTCTGGA CCTGGCGCGA GACCATGTAC
CGCGTTGCCG AGCGGCTGGA CCCCGACACC TACCACCGAC TCGCCCGGGC GGTCTACGTC
GAGATGGCCC TGGCCGGGAT CACCTGCGTG GGCGAGTTCC ACTACCTGCA CCACGCACCC
GGCGGGGATC GCTACGCCGA CCCCAACGCC ATGGGCCACG CCCTGGCGGC CGCCGCGGCC
GACGCGGGGA TCCGGATCAC CCTGCTGGAC GTGTGCTACC TGTCCGGCGG GCTGGACGGG
AACGGCGTCC ACCAGCCGCT GGCCGGGCCC CAGCTGCGCT TCGGCGACGG GGACGCGGAC
GGGTGGGCCG AACGCGCCGC CGCCTTCCGT CCCGGGGGCG GGCACGTGCG CACGGGAGCG
GCCGCCCACT CGGTGCGCGC CGTCCCCGCC GCGCAGCTGC CCGAGGTGGC CGCTTTCGCC
GCCGGGCGCG ACGCCGTGCT GCACGTCCAC GTCTCCGAGC AGCCCGGTGA GAACGCCGCC
TGCCTGGCCG CCTACGGCCG CACCCCCACG GCCGTGCTCG CCGACGCGGG CGCGCTCACC
CCGCGCACCA GCCTGGTGCA CGCCACCCAT CTGAGCGACG CCGACGTGGC GGCCGTCCGC
GCGGCCGGGT CCACGGTGTG CCTGTGCCCC ACGACCGAGC GCGACCTGGC CGACGGCCTG
CCGCGCACCG GCGACCTGCT GCCCGCCCCG CTCAGCCTGG GCACCGACCA GCACGCCCTG
ACCGACATGT TCGAGGAGGC CAGGGCGGTC GAACTCCACG AGCGCCTGCG CACCCACCGG
CGCGGCACCC TGGGCGCCGG GGAGCTGCTG CGCGCGGCCA CCGCGCACGG GCACGCCAGC
CTCGGCTGGA CCCGCGAACC GGGGGCCGCC GCCCCCGGGG CGTCCGAGGG GTCGGCGCAC
GTCGGAGCGG GACCACAGGA ACCCCCGCGC GGGGCTTCCG ACGCCGGTGT CCTGGCCCCC
GGGGCGCGGG CCGACCTGGT CAACGTCCCC CTGGACGGAA CCCGCCTGGC CGGGGCCGAC
CCCGCCCGGG CCGCCGACGC CGTCGTCTTC GCCGCCGCCT CCGCCGACGT GCGGCACGTG
ATGGCCGACG GGCGCTGGAC CGTCCGCGAC GGCGTCCACA CCCTGGTTCC CGACACCGCG
CGCGAACTCG ACACGGTCAT CAAGGAGGTC CTCACATGA
 
Protein sequence
MTTGADSGPA ASAPGGSAGP AACGTASGRS RGPRDHRLWC EWAWTGAEDG TPEHGVLVEV 
ADGRITSVTV ATPRPGDAET LTGLTLPGLA NAHSHAFHRA LRGRTHAGGG SFWTWRETMY
RVAERLDPDT YHRLARAVYV EMALAGITCV GEFHYLHHAP GGDRYADPNA MGHALAAAAA
DAGIRITLLD VCYLSGGLDG NGVHQPLAGP QLRFGDGDAD GWAERAAAFR PGGGHVRTGA
AAHSVRAVPA AQLPEVAAFA AGRDAVLHVH VSEQPGENAA CLAAYGRTPT AVLADAGALT
PRTSLVHATH LSDADVAAVR AAGSTVCLCP TTERDLADGL PRTGDLLPAP LSLGTDQHAL
TDMFEEARAV ELHERLRTHR RGTLGAGELL RAATAHGHAS LGWTREPGAA APGASEGSAH
VGAGPQEPPR GASDAGVLAP GARADLVNVP LDGTRLAGAD PARAADAVVF AAASADVRHV
MADGRWTVRD GVHTLVPDTA RELDTVIKEV LT