Gene Noca_3933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3933 
Symbol 
ID4598068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4139472 
End bp4140890 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content76% 
IMG OID639778539 
Producthydrolase 
Protein accessionYP_925118 
Protein GI119718153 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases
[COG1606] ATP-utilizing enzymes of the PP-loop superfamily 
TIGRFAM ID[TIGR00268] conserved hypothetical protein TIGR00268 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.158663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCGG CGCCCCTGGT GGTCGGTTTC GACCTCGACA TGACGCTGAT CGACACCGTG 
CCGGGCTTCA GTGCGACGCT GCTGGCGCTC GGCGCGGAGC TCGGCGTCGA GTTCCCGGTC
GAGGACCTGA TCACCCGTCT CGGACCGCCG CTCGACCTGC TGCTGGGGGA GCACCTCGCC
GCGGACGCGG TGGCGCCCGC CGGCGACCGG TTCCGGGCGC TCTACCCCGA CCACGCCATC
GCCCCGGTCC CGCTCCTCGC CGGCGCCGAG GACGCCCTGG CGGCTGTCCG CCGGCACCGT
GGCCGGGTGC TGGTCGTGAC CGGGAAGTAC CCCGCCAACG CGCGCCTGCA CCTCGATCAC
CTCGGCGTGG AGGTGGACCA CCTCGAGGGG TGGGTGTGGG GCGTCGGCAA GGCGGACGTG
CTGCGCCGCG AGGGCGCGAC CATCTACGTC GGCGACCACG TCCACGACGT CGAGGGCGCC
AAGGCCGCCG GCGCGCTGAG CGTCTCGGTG CTCACCGGCG GGTCGACGCG GGAGGAGCTG
GTCGCGGCCG GCACCGACGT GCTCCTCGGC AGTCTCGCGG AGTTCCCGGA CTGGCTCGAG
GAGCACCTGC TGCAGACCCG GCTCGACGCG CTCGCGGCCG ACCTGCGCGA GCGTGGCTCG
GTGCTGGTCG CCTACAGCGG GGGCGCAGAC AGCGCGTTCC TCCTGGCCGC TGCCGTCCGC
GCGCTGGGCG CCGACCGCGT CGCGGCCGCC ACCGGCTACT CGCACTCGCT GCCGCTGGCC
GAGCGTGACC CGGCACGCGA CTTCGCGGCC GCGCTCGGCG TCGAGGTGCT CACCCCGGCC
ACCCACGAGA TCGAGCGCGA GGGCTACCGG TCCAACGGGG CGGACCGCTG CTACTTCTGC
AAGGCCGAGC TGCTCGACGT GCTCACCCCG CTCGCCGCCG GTCGCGGGCT GGCCCACGTC
GCCACCGGCA CCAACGCCGA CGACGCCGTC GCGGGCTTCC GGCCCGGCAT CCGCGCAGCC
GACGAGCGCG GCGCGATCGC GCCCCTGCGC GACGCCGGGC TGACCAAGGC CCAGGTCAGG
GAGGCCTCCC GCCGCTGGGA CCTGCCGACC TGGGACAAGC CGGCGGCCGC CTGCCTGTCC
TCGCGGGTCG CGTACGGCGT CGAGGTGACG CCGTACCGAC TGGGCCGGGT GGAGCGGGCC
GAGACCGCGG CGCGGGCGCT GCTGGCGGCC GTCGGGCTGC GCAACCTGCG GGTCCGTGAC
CTCGGCGAGC GCGCCTGCGT GGAGGTCGAC GCGGCCCTGC TGCCGCTGGC CGCCGACGTG
GAGGCCCGGC TGCTGGACGC GGTGCGCGGG GCCGGGTTCG CGAGTGCCGA GGTGGACCGG
CGCGGGTTCC GTTCGGGGTC GATGAACGAG GCGCTATAG
 
Protein sequence
MHAAPLVVGF DLDMTLIDTV PGFSATLLAL GAELGVEFPV EDLITRLGPP LDLLLGEHLA 
ADAVAPAGDR FRALYPDHAI APVPLLAGAE DALAAVRRHR GRVLVVTGKY PANARLHLDH
LGVEVDHLEG WVWGVGKADV LRREGATIYV GDHVHDVEGA KAAGALSVSV LTGGSTREEL
VAAGTDVLLG SLAEFPDWLE EHLLQTRLDA LAADLRERGS VLVAYSGGAD SAFLLAAAVR
ALGADRVAAA TGYSHSLPLA ERDPARDFAA ALGVEVLTPA THEIEREGYR SNGADRCYFC
KAELLDVLTP LAAGRGLAHV ATGTNADDAV AGFRPGIRAA DERGAIAPLR DAGLTKAQVR
EASRRWDLPT WDKPAAACLS SRVAYGVEVT PYRLGRVERA ETAARALLAA VGLRNLRVRD
LGERACVEVD AALLPLAADV EARLLDAVRG AGFASAEVDR RGFRSGSMNE AL