Gene Noca_3533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3533 
Symbol 
ID4595715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3743754 
End bp3744875 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content71% 
IMG OID639778141 
Productadenosine deaminase 
Protein accessionYP_924720 
Protein GI119717755 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0451471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTGT CGCCGAGCCC GTCCCCGAAC CCGTCCCCGA CCCTCGACCA GGTGCAGCGG 
GCGCCGAAGG CGCTGCTGCA CGACCACCTG GACGGCGGGC TGCGGCCCCA GTCGATCATC
GAGCTCGCAG CGGAGATCGG GCACCAGCTG CCGGCCGTGG AGGGCGACCT GTCCGCCGAG
TCGCTCGGCC GCTGGTTCGC GGAGGCCGCC GACTCCGGCT CGCTGGAGCG CTACCTGGAG
ACCTTCGACC ACACGGTCTC GGTGATGCAG ACGGCCTCGG CGCTGACCCG GGTGGCGCGG
GAGTGCGTGG AGGACCTGGT CGCCGACGGC GTGGTGTACG CCGAGGTCCG CTACGCGCCC
GAGCAGCACG TGGTCCAGGG GCTGAGCCTC GACGAGGTCG TCGCGGCGGT CCAGGAGGGC
TTCGACCAGG GCGTGGAGGC GGCCGGCGGG CGGATCGTCG TCCGCCAGCT GCTCACCGCG
ATGCGGCACC AGGCTCGGTC GATGGAGATC GCCCACCTCG CGGTCGCGTG GCGCGATCGC
GGCGTCGCCG GCTTCGACAT CGCCGGTGCC GAGGCCGGCT ATCCCCCCAC CCGCCACCTG
GACGCGTTCG AGTACCTGCA GCGGGAGAAC GCCCACTTCA CGATCCACGC CGGCGAGGGC
TTCGGGCTGC CGTCGATCTG GCAGGCCATC CAGTGGTGTG GCGCCGACCG GCTCGGGCAC
GGCGTCCGGA TCATCGACGA CATCACGGTC GCCGAGGACG GGGCCGTGAG CCTCGGCCTG
CTGGCGGCGT ACGTCCGCGA CAAGCGGATC CCGCTGGAGA TGTGCCCCTG GTCGAACGTG
CAGACCGGCG CGGCCACCTC GATCGCCGAG CACCCGATCG GGCTGCTGAA GCGGCTCGGC
TTCCGGGTGA CGGTGAACAC CGACAACCGG CTGATGAGCC GCACCTCCGT GACCCACGAG
CTGTGGTCGT TGGTCGAGGC GTTCGGCTAC GGGTTGAAGG ACCTGGAGTG GTTCACGATC
AACGCGATGA AGTCGGCGTT CCTGCCCTTC GACGAGCGGC TGGCGCTGAT CACCGATGTG
ATCAAGCCGG AGTACGCCGT GCTCAAGGCC GAGCACGCGT GA
 
Protein sequence
MTVSPSPSPN PSPTLDQVQR APKALLHDHL DGGLRPQSII ELAAEIGHQL PAVEGDLSAE 
SLGRWFAEAA DSGSLERYLE TFDHTVSVMQ TASALTRVAR ECVEDLVADG VVYAEVRYAP
EQHVVQGLSL DEVVAAVQEG FDQGVEAAGG RIVVRQLLTA MRHQARSMEI AHLAVAWRDR
GVAGFDIAGA EAGYPPTRHL DAFEYLQREN AHFTIHAGEG FGLPSIWQAI QWCGADRLGH
GVRIIDDITV AEDGAVSLGL LAAYVRDKRI PLEMCPWSNV QTGAATSIAE HPIGLLKRLG
FRVTVNTDNR LMSRTSVTHE LWSLVEAFGY GLKDLEWFTI NAMKSAFLPF DERLALITDV
IKPEYAVLKA EHA