Gene Ndas_0714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0714 
Symbol 
ID9244556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp876067 
End bp877416 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content76% 
IMG OID 
Productallantoinase 
Protein accessionYP_003678665 
Protein GI297559691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.866042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC ACGATCTCGC CGTCCGCGCC CGCCGCGCCC TCACCCCCGA GGGGGAGAGG 
CCCGTCACCG TCGGGGTGCG CGACGGGCGC GTGGTCAACG TCCTGGAGGG GGAGTCCGCG
CCGCTCATCG CCCGCCGCGA GATCACCCTG GCCGGGGACG AGGTGCTGCT GCCGGGCCTG
GTCGACACCC ATGTGCACGT CAACGAGCCC GGGCGCACCG AGTGGGAGGG CTTCGCCACC
GCCACGCGGG CCGCCGCCCT GGGCGGGATC ACGACCCTGG TGGACATGCC GCTCAACAGC
GTGCCGCCGA CCACCACCGT CGCCGGGCTC GACGCCAAGC AGGCCGCCGC CGATGGCAAG
CTCGCCGTGG ACGTGGGCTT CTGGGGCGGG GCGGTCCCCG AGAACAGCCG CTCCGGCGCC
ACGAAGGAGC TCGCCGCGCT CTGGGAGCGC GGGGTGTTCG GGTTCAAGGC GTTCCTGTCC
CCGTCGGGGG TGGAGGAGTT CGGCCACCTG TCCCAGGAGG AGCTGTACTC GGCCGCGGAG
GCGATCGGGG AGTTCGGCGG ACGGCTCATC GTGCACGCCG AGGACCCCGG CGTCCTGGAC
GGGGCGCCGC CCGCGGCCGG GCGCGACTAC GCCTCCTTCC TGGCCTCGCG GCCCGACACC
GCCGAGAACG AGGCGATCGC CCGGGTGATC GACGTGGCCA GGGCCACCGG CACCCACGCG
CACGTGCTGC ACCTGTCCAG CGCCTCCGCC CTGCCGCTGA TCGCCCGGGC CAGGGCCGAG
GGCGTGCCGC TGACGGTGGA GACCTGCCCG CACTACCTCA CCCTGGAGGC CGAGGGCGTG
CCCGCGGGGG CCACCCAGTA CAAGTGCTGC CCGCCCATCC GGGACGCGGC CAACCGCGAC
CTGCTGTGGC GCGCGCTGGC CGACGGGCTG ATCGACTGCG TGGTCAGCGA CCACTCGCCC
AGCACCCCCG ACCTCAAGGA TCTGGACACC GGCGACTTCG GCACCGCGTG GGGCGGGGTG
TCCGGCCTCC AGGTGGGCTT CTCCGCGGTG TGGACCGAGG CCCGCCGCCG CGGCGGATCC
CTGGCGGACG TGGTGCGGTG GATGTCGTCC GGGCCCGCGC GGGTGGCCGG ACTGCGCGGC
AAGGGCGCCA TCGCCGAGGG CGCCGACGCC GACTTCGCCG TGGTCGCCCC GGAGGAGTCC
TTCCGGGTGG ACGTGCGCGC GCTGGAGCAC CGCAACCCGG TGAGCCCCTA CGACGGGGCC
GAACTGCTCG GGCGGGTGCG CCGCACCGTC CTGCGCGGTC GGGACGTGGG GCCCGGGGAC
AGGGCCGGGC GGATGCTCGT CCGCGGGTAG
 
Protein sequence
MTQHDLAVRA RRALTPEGER PVTVGVRDGR VVNVLEGESA PLIARREITL AGDEVLLPGL 
VDTHVHVNEP GRTEWEGFAT ATRAAALGGI TTLVDMPLNS VPPTTTVAGL DAKQAAADGK
LAVDVGFWGG AVPENSRSGA TKELAALWER GVFGFKAFLS PSGVEEFGHL SQEELYSAAE
AIGEFGGRLI VHAEDPGVLD GAPPAAGRDY ASFLASRPDT AENEAIARVI DVARATGTHA
HVLHLSSASA LPLIARARAE GVPLTVETCP HYLTLEAEGV PAGATQYKCC PPIRDAANRD
LLWRALADGL IDCVVSDHSP STPDLKDLDT GDFGTAWGGV SGLQVGFSAV WTEARRRGGS
LADVVRWMSS GPARVAGLRG KGAIAEGADA DFAVVAPEES FRVDVRALEH RNPVSPYDGA
ELLGRVRRTV LRGRDVGPGD RAGRMLVRG