Gene Noca_4680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4680 
Symbol 
ID4598224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4964017 
End bp4965606 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID639779289 
Productprotein of unknown function DUF853, NPT hydrolase putative 
Protein accessionYP_925862 
Protein GI119718897 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0218814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCGG ACATGACGTC GCCGATCGCC GAGCAGGTCG CCGCGGGCTA CCGCTTCGAG 
GGTCCGGCCC TCGAGCTCGG CGCCCTGATG CTCGCCGCCG ACCAGCTGGT CGACGTACCG
GTCCGGATTC CGTTGGCGAT GCTGAACCGG CACGGCCTGG TGGCCGGGGC GACCGGCACC
GGCAAGACCA GGACTCTTCA GCTGCTCGTC GAGCAGCTCA GCGCCCAGGG CGTCCCGGTC
TTCGCCGCGG ACATCAAGGG CGACCTGTCC GGGCTGGCCC AGCCGGGCAC CGCGAGCGAG
AAGCTCAGCG CCCGGGCCGC CACCGTCGGC CAGGAGTGGG CGGCCGCCGG CTTCCCCGTG
GAGTTCTACG CGATCGGCGG CGTCGGGCCC GGGCTGCCGC TGCGGGTCAC CATGAGCGCG
TTCGGGCCGA CCCTGCTGAG CAAGGTGCTG GGCCTCAACG ACACCCAGGA GTCCAGCCTG
GGGCTGGTCT TCCACTACGC CGACCGGGCC GGCCTGCCGC TGCTCGACCT CGCCGACCTC
CGCGCGGTGC TCGCGCACCT GCTCAGCGAC GAGGGCAAGG CCGAGCTCAA GGCGCTGGGT
GGGCTGTCGT CGGCGACCGC CGGGGTGATC CTGCGCGAGC TGATCGGCCT GGAGGACCAG
GGCGGCGACG TGTTCTTCGG CGAGCCGGAG TTCGAGTCGG CGGACCTGCT CCAGCTCGCC
CCCGACGGCC GCGGCCTCGT CTCGCTGGTC GAGCTGCCGC AGCTGCAGGA CCGGCCGGCG
ATCTTCTCGA CGTTCCTGAT GTGGCTGCTC GCCGACCTGT TCCACGACCT TCCCGAGGTC
GGGGACGTGG ACAAGCCGAG GCTGGTGTTC TTCTTCGACG AGGCGCACCT GCTCTTCGCC
GACGCGTCCA AGGCGTTCCT CGACCAGGTC GCCCAGACCG TGCGGCTGAT CCGGTCGAAG
GGGGTCGGGG TGTTCTTCGT GACCCAGAGC CCCACCGACG TGCCCGACGC GGTGCTCGCC
CAGCTCGGTT CGCGGATCCA GCACCAGCTG CGCGCGCACA CCCCCAACGA CGCCAAGGCG
CTCAAGGCGA CCGTGGCGAC CTACCCGACC AGTGGGTACG ACGACCTCGG GCAGGTCATC
ACCGGCCTCG GGATCGGCGA GGCCGTGGTG ACCGTGATGA ACGAGCGCGG TGCGCCGACG
CCGGTGGCCT GGACCCGCCT GCGGGCGCCC CAGTCGCGGA TGGATCCGTG CGATCCCGAC
GTCCTCACCG CCACCGTCGC GGCCAGCCCG CGGGCCGCGA AGTACCAGGC CGCGATCGAC
CGGGAGTCCG CGCGCGAGAT CCTCGCCGAC CGGCTCGAGC AGGGTGCCGC GAAGCAGGAC
CGCGAGCAGG CGGGCGCCCC CGGCCCGGAC CCTGATCCGG CGCCGCGCCC GGTGCCGGTC
CCGAAGCCCA GCACCGACAA GCCCAGCAGC AGGCCCCCGA AGGACGACAG CGTGGTCGAG
CAGGTCGTGA AGTCCGACGC GTTCAAGGAC TTCATGCGTA CCGCCGCCCG CGAGATCGCG
CGGGGGATGT TCAAGACCGG CCGGCGCTGA
 
Protein sequence
MTADMTSPIA EQVAAGYRFE GPALELGALM LAADQLVDVP VRIPLAMLNR HGLVAGATGT 
GKTRTLQLLV EQLSAQGVPV FAADIKGDLS GLAQPGTASE KLSARAATVG QEWAAAGFPV
EFYAIGGVGP GLPLRVTMSA FGPTLLSKVL GLNDTQESSL GLVFHYADRA GLPLLDLADL
RAVLAHLLSD EGKAELKALG GLSSATAGVI LRELIGLEDQ GGDVFFGEPE FESADLLQLA
PDGRGLVSLV ELPQLQDRPA IFSTFLMWLL ADLFHDLPEV GDVDKPRLVF FFDEAHLLFA
DASKAFLDQV AQTVRLIRSK GVGVFFVTQS PTDVPDAVLA QLGSRIQHQL RAHTPNDAKA
LKATVATYPT SGYDDLGQVI TGLGIGEAVV TVMNERGAPT PVAWTRLRAP QSRMDPCDPD
VLTATVAASP RAAKYQAAID RESAREILAD RLEQGAAKQD REQAGAPGPD PDPAPRPVPV
PKPSTDKPSS RPPKDDSVVE QVVKSDAFKD FMRTAAREIA RGMFKTGRR