Gene Ndas_5106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5106 
Symbol 
ID9248998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp252896 
End bp253945 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content74% 
IMG OID 
Productadenosine deaminase 
Protein accessionYP_003682993 
Protein GI297564020 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.582125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.494241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACAC CGACCAGCGC CCGCCGGTTG GACCGGCTGC CCAAAGCACA CCTGCACCTG 
CACTTCACCG GATCGATGCG CCATCCCACC CTGGTGGAGC TGGCCGCCGA GCACGGCATC
CACCTCCCCC AGGCCCTGGT CGAGGAGTGG CCGCCCAGGC TCCGCGCCAC GGACGAGCGC
GGCTGGTTCC GCTTCCAGCG CCTGTACGAC ATCGCCCGGT CGGTGCTGCG CAGACCCGAG
GACGTGTACC GGCTGCTGCG CGAGGCGGCC GAGGACGAGC GCGCGGCCGG GTCCGGCTGG
CTGGAGATCC AGGTGGACCC GAGCGGTTAC GCGTCGCTCT TCGACGGGCT CACCGCCACC
CTGGAGCTGT TCCTGGACGC GGCCCGCGCC GCCGAGCGCG AGACCGGCGT GCACGTCGGC
CTGATGGTGG CGGCCAACCG CACCAGGCAC CCGCTGGACG CCAAGGTGCT GGCCCGCCTG
GCCCGCCAGT ATGCGGGCAG GGGCGTGGTG TCGTTCGGCC TGAACAACGA CGAGCGGCGT
GGCCGCGCCC TGGAGTTCGA GGGGGCGTTC CGGATCGCGC GGCGGGCCGG GCTGCTCTCC
GCTCCGCACG GCGGCGAGCT CCAGGGACCC CGCAGCGTGC GCGAGTGCCT GGACGTGCTG
GACGCCGACC GGATCGGGCA CGGTGTGCGG GCCGTGGAGG ACCCGCGGCT GGTGGAGCGG
ATCGCCGAAC GCGGGGTGAC CCTGGAGGTC TGCCCGACCT CCAACGTGGG CCTGGGGGTG
TACGACGACC TGGGGCAGCT GCCGCTGCGC ACGCTCTTCG ACGCCGGGGT TCCGGTCGCT
CTGGGCACCG ACGACCCGCT GCTGTTCGGA CCGCGCCTGG TGGAGCAGTA CCGGATCGCC
CGCGAGGTGC TCGGGTTCTC CGACCCGGAG CTGGCCGAGC TGGCGCGGAT GTCGGTCCGC
GGCTCGGGCG CGCCGGAGTC GCTGCGCAAG GAGCTGCTGG CCGGGGTGGA CGCGTGGCTG
GCCGCCGACC CGGAGCCCGT CGGGGACTGA
 
Protein sequence
METPTSARRL DRLPKAHLHL HFTGSMRHPT LVELAAEHGI HLPQALVEEW PPRLRATDER 
GWFRFQRLYD IARSVLRRPE DVYRLLREAA EDERAAGSGW LEIQVDPSGY ASLFDGLTAT
LELFLDAARA AERETGVHVG LMVAANRTRH PLDAKVLARL ARQYAGRGVV SFGLNNDERR
GRALEFEGAF RIARRAGLLS APHGGELQGP RSVRECLDVL DADRIGHGVR AVEDPRLVER
IAERGVTLEV CPTSNVGLGV YDDLGQLPLR TLFDAGVPVA LGTDDPLLFG PRLVEQYRIA
REVLGFSDPE LAELARMSVR GSGAPESLRK ELLAGVDAWL AADPEPVGD