Gene Noca_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1809 
Symbol 
ID4597646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1923920 
End bp1925932 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content70% 
IMG OID639776408 
Productalpha amylase, catalytic region 
Protein accessionYP_923008 
Protein GI119716043 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.341723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGGAC GCATCCCCGT CATGAACGTC ACGCCACTGG TCGACCTCGG CCGGCAACCA 
GCGAAGGCCA CGGTCGGAGA GCCCTTCCCG GTGACCGCGA CGATCTTCCG CGAGGGCCAC
GACAAGCTGG GCGCCGAGGT GGTGCTCACC GGTCCCGACG GCCGCCGCCG GGCGCCGGTC
CGGATGACCA AGCACGCGAC GGTCCCGGAC CACTACGAGG CCTGGGTGAC CCCCGACACG
GGCGGCGCCT GGAGCTTCGA GGTGATGGCC TGGTCCGACC CGCTCGCGAC CTGGCAGCAC
AACGCCGCGC TCAAGGTCCC GGCCGGCGTC GACGTCGACC TGATGTTCAC CGAGGCCCGG
ATCCTGCTCG AGAAGGTGAT CGCCTCGCTC GACCCCCGCG ACCCGCCCGC GGCCGACGCC
GCCCAGGTCC TCCAGGGTGC CCTGGACACG GCCACCGACA CCAAGCGGCC ACCGGCCGCC
CGGCTGGCCG TGCTCGAGGC CCCCGACGTC ACCGGCGTGC TCGCCACCCA CCCGATCCGC
GAGCTGGTCA CGGTCGAGGG CCCCTACCCG GCGTACGCCG ACCGGCCGAA GGCGCTCACC
AGCAGCTGGT ACGAGTTCTT CCCGCGCTCG GAGGGCGCCA CCAAGGACCT GAAGACCGGC
AAGGTCACCA GCGGCACATT CGCGACCGCC GCCAAGCGCC TGGACGCCGT GGCGGCGATG
GGGTTCGACG TCATCTACCT GCCGCCGATC CACCCGATCG GCGAGGTCAA CCGCAAGGGC
CCGAACAACA CCGTCGACCC CGGTCCCGAC GCAGCACCGA AGGACTGGGC GGGGTCGCCG
TGGGCGATCG GCTCCAAGGA CGGCGGCCAC GACGCGATCC ACCCCGACCT CGGCACCTTC
GACGACTTCG ACGTGTTCGT CGCGAAGGCC CGGTCCCTCG ACCTCGAGGT CGCGCTCGAC
CTCGCACTCC AGGCGGCGCC CGACCACCCG TGGGTGACCA CCCACCCGGA GTGGTTCACG
ACCCGCGCCG ACGGCACCAT CGCGTACGCC GAGAACCCGC CGAAGAAGTA CCAGGACATC
TACCCGATCA ACTTCGACAA CGACCCGACC GGCATCTGCC AGGAGGTGCT GCGGATCGTC
CGGCTGTGGA TGTCGCACGG CGTGCGCATC TTCCGGGTCG ACAACCCGCA CACCAAGCCG
GTGGCGTTCT GGGAGTGGCT GCTCAAGGAG ATCCGGCGCA CCGACCCCGA CGTGATCTTC
CTGGCCGAGG CGTTCACCCG GCCGGCGATG ATGCGCGGCC TCGGCGCGAT CGGCTTCCAC
CAGTCCTACA CGTACTTCAC CTGGCGCAAC GCGAAGTGGG AGCTCGAGGA GTACCTCCGC
GAGCTGTCCC GCGAGACCGA CCACCTGATG CGGCCGAACT TCTTCGTGAA CACGCCGGAC
ATCCTGCACG CCTACCTGCA GTACGGCGGG CCCGCGGCGT TCAAGATCCG CGCCGCGATT
GCCGCGACCG GGTCGCCCAG CTGGGGCGTC TACGCCGGCT ACGAGCTGTA CGAGCACGTC
GCGGTCCGCC CCGGCAGCGA GGAGTACCTC GACTCGGAGA AGTACCAGAT CCGGATCCGC
GACTGGGACG CCGCCGAGCG CGAGCACCGC ACGCTCGCGC CGTACCTGAC GCGCCTCAAC
GAGATCCGCC GCCGGCACCC GGCGCTCCAG CTGCTGCGCA ACGTGTCGAT CCACTGGAGC
GACGACGAGA ACATCCTGGT GTTCAGCAAG CGGCGGGCGC TCCCCGACGG CCCCGACGAT
GTCGTGATCG TGGTCGTCAA CGTCGACCCG CACGCAGCTC GCGAGACCAC GGTCCACCTC
GACCTGGGCG CACTCGGGCT GTCGCCGACC GACTCGTTCC TGGTGCACGA CGAGATCACC
GGCGCCGACT GGAGCTGGGG CGAGCACAAC TACGTGCGGC TCGACCCGTA CCACGAGCCG
GCGCACATCT TGAGCGTGAG GAGGCCCCGG TGA
 
Protein sequence
MVGRIPVMNV TPLVDLGRQP AKATVGEPFP VTATIFREGH DKLGAEVVLT GPDGRRRAPV 
RMTKHATVPD HYEAWVTPDT GGAWSFEVMA WSDPLATWQH NAALKVPAGV DVDLMFTEAR
ILLEKVIASL DPRDPPAADA AQVLQGALDT ATDTKRPPAA RLAVLEAPDV TGVLATHPIR
ELVTVEGPYP AYADRPKALT SSWYEFFPRS EGATKDLKTG KVTSGTFATA AKRLDAVAAM
GFDVIYLPPI HPIGEVNRKG PNNTVDPGPD AAPKDWAGSP WAIGSKDGGH DAIHPDLGTF
DDFDVFVAKA RSLDLEVALD LALQAAPDHP WVTTHPEWFT TRADGTIAYA ENPPKKYQDI
YPINFDNDPT GICQEVLRIV RLWMSHGVRI FRVDNPHTKP VAFWEWLLKE IRRTDPDVIF
LAEAFTRPAM MRGLGAIGFH QSYTYFTWRN AKWELEEYLR ELSRETDHLM RPNFFVNTPD
ILHAYLQYGG PAAFKIRAAI AATGSPSWGV YAGYELYEHV AVRPGSEEYL DSEKYQIRIR
DWDAAEREHR TLAPYLTRLN EIRRRHPALQ LLRNVSIHWS DDENILVFSK RRALPDGPDD
VVIVVVNVDP HAARETTVHL DLGALGLSPT DSFLVHDEIT GADWSWGEHN YVRLDPYHEP
AHILSVRRPR