Gene Noca_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0197 
Symbol 
ID4599015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp209959 
End bp212736 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content71% 
IMG OID639774810 
Productalpha amylase, catalytic region 
Protein accessionYP_921429 
Protein GI119714464 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAAC TGATTTCCGC ACCGCTCGTC CTCGCGGGGG CCATGACCCT CGGCGTCACC 
CTCGCGGCCC CGGCCGGCGC CGCGGCTCCT GCACGAGCGG ACGGGCCGGG CCACCGCCCC
TCGGCGTACG CCCAGCACGC CCCGCGCCCC GGTGTCACCG ACGAGGACTT CTACTTCGTC
ATGGGCGACC GGTTCGAGAA CGGCGACCCG GACAACGATC GCGGCGGCCT CACCGGCGAC
CGGCTCACCA CCGGCTTCGA CCCGGCCGCC AAGGGGTTCT ACAACGGCGG GGACCTGGAC
GGGCTGCGCT CCCGGCTGGA CTACATCCAG GGCCTCGGCA CCGACTCGAT CTGGCTGACG
CCGATCTTCA AGAACAAGCC GGTCCAGCTC GAGGACGGAC CGTCGGCCGG CTATCACGGC
TACTGGATCA CCGACTTCAC CACCGTCGAC CCGCACCTGG GCACCAACCA GGACCTCGCC
GACCTCGTGG ACGCCGCCCA CGACCGGGGG ATGAAGGTCT ACTTCGACAT CATCACCAAC
CACACGGCCG ACGTGATCGG CTACGAGGAG GGCGACCGCA CGGCGTACGT CTCCAAGGAC
GCGGTGCCCT ACCGCGACGC CGCCGGCAAC CCCTTCGACG ACCGCGACTA CGCCGGCACC
TCGCAGTTCC CGGTGCTGGA CCCGGCCACG TCCTTCCCGT ACACCCCGGT GCTCGACGCG
GGCGAGCAGG ACCTCAAGGT GCCGGCCTGG CTCAATGACC TGACGAACTA CCACAACCGC
GGCAACACGA CGTTCTCCGG CGAGGACAGC CAGTACGGCG ACTTCTTCGG CCTCGACGAC
CTGTTCACCG AGAAGCCGGC GGTGGTCGAC GGGATGATCG ACATCTACCG CACCTGGGTC
GAGGACTTCG ACATCGACGG CTTCCGGATC GACACGATGA AGCACGTCGA CGACGCGTTC
TGGCAGCGGT TCGGTCCGAC CATGCAGCAG CTCGCCGCGG CCGACGGCAA CCCCGGGTTC
TTCATGTTCG GAGAGGTCGC CCTCGACGGC AGCGACGCCG CCGCCAAGGC GTTCACCTCG
CGCTACACCA CCACCGATCG GATGCAGGCG GTGCTGGACT TCCCGTTCCA GGACGCCGCC
CGGGGGTTCG CGTCGAAGAG CCTCGACAGC GACAGGCTGG CGCGGTTCTT CCGCAACGAC
GACTGGTACA CGGACGCGGA CTCGAACGCC TACTCGCTGC CGACCTTCCT CGGCAATCAC
GACATGGGCC GGATCGGGCA CTTCCTGGAG GCCGACAACC CCGGCGCCGA CGACGCCGAA
CTGGTGGCGC GCGACCGGCT GGCCCACGAG CTGATGTACC TCTCCCGGGG CAACCCGGTC
GTCTACTACG GCGACGAGCA GGGCTTCACC GGCGACGGCG GGGACCAGCT CGCCCGGCAG
ACGATGTTCG CCAGCCAGGT GCCGGAGTAC CGGGACGACG ACCAGATCGG CACCGACCGC
ACCGGCGCCG ACGACAACTT CGTCACCGAC GCCCCGCTGT ACCGAGCGAT CCGGGACCTC
GCCGACCTGA CCGAGCGGGA CCCGGCCCTG CGCGACGGGG CGCAGCAGGT CCGCTACTCC
TCGCCGGGCG CCGGGCTGTT CGCGTTCTCG CGGGTCGACC GCGAGCATCG GGTCGAGTAC
GTCGTGGCGC TGAACAACAG CGAGCAGGCC GACTCGGGGC GGGTGCCGAC GTACCTCAAG
AAGGGCGAGT TCCGGAAGGT GTACGGGCCG GGGCCGGCCC GCCTGGTGAG CCGCGCGGAC
CGGGGCCTGG ACGTCGCGCT CCCGGCGCTG TCGACCTCCG TCTACCGCTC GGTCGACCGG
ATCCCCCGCG CCGAGCGCGC TCCGCAGATC TCGGTGGCCC GGCCGGTGCC GTCGTCGATC
GCGCACGGCC GGATGCACGT GCGGGCCAAG GTGTCGGGCT CCTCGTTCTA CGAGGTGACC
TTCCAACGCC GCATCGGCGA CGGTGCCTGG CAGACGATCG GGGTCGACGA CAACGCGCCG
TACCAGGTGT TCGACGACGT CGCCGACCAG ACCCCGGGGA CGGCCGTGAG CTATCGGGCC
GCCGTGCTCG ACAACGCCGG GCACACCCGC GGGTCCCGGG CCCGCACCAC CGCGGTGCCA
CGGACCTCCG TGGTGGTGAC GGGCCCGGCC GACGGCGGCA CGGTCAGCGT GGTCGACCCG
GTGACCGTGA CCGCCTCGGT CGACCCGGAG CGGCCGCTGC AGTCGGTGGC GTTCGAGCGC
AGCGTCGGCG ATGGGCCCTG GACGAGCCTG GGCACCGACA GCTCCGCTGC GGCCTACACC
GTGACCGACG ACGTGTCGGA CCTGCCGCTC GGCACCGAGG TGCGCTACCG GGCCACGCTC
AGCGAGGCCG GCGTCGTGCA GGGCACGAGC GCCCCGGTCA GGGTGACCAC CGCCGAGCCC
GCGCCGGCGG TCGACTCGGT GACGGTCGCC GGCAGCCTGC AGTCCGAGCT CGGCTGCCCC
GAGGACTGGC AGCCCGGCTG CGCTGCCACG CACCTGACCT TCGATCGCAC CGACGGGCGC
TGGCACGGCA CCTTCACCGT GCCCGCCGGC GACTACGAGT GGAAGGTCGC GATCAACGAC
TCCTGGGACG TCAACTACGG TGCGGGTGGC GCCGCCGGCG GCAGCAACCT CGCCCTCCCG
GTCCCGGCCG GTGGCGCGAC GTACGTGTTC ACCTGGGACC AGGTCAGCCA CGAGCCCTCG
GTCGCACCAG CCGGCTGA
 
Protein sequence
MRQLISAPLV LAGAMTLGVT LAAPAGAAAP ARADGPGHRP SAYAQHAPRP GVTDEDFYFV 
MGDRFENGDP DNDRGGLTGD RLTTGFDPAA KGFYNGGDLD GLRSRLDYIQ GLGTDSIWLT
PIFKNKPVQL EDGPSAGYHG YWITDFTTVD PHLGTNQDLA DLVDAAHDRG MKVYFDIITN
HTADVIGYEE GDRTAYVSKD AVPYRDAAGN PFDDRDYAGT SQFPVLDPAT SFPYTPVLDA
GEQDLKVPAW LNDLTNYHNR GNTTFSGEDS QYGDFFGLDD LFTEKPAVVD GMIDIYRTWV
EDFDIDGFRI DTMKHVDDAF WQRFGPTMQQ LAAADGNPGF FMFGEVALDG SDAAAKAFTS
RYTTTDRMQA VLDFPFQDAA RGFASKSLDS DRLARFFRND DWYTDADSNA YSLPTFLGNH
DMGRIGHFLE ADNPGADDAE LVARDRLAHE LMYLSRGNPV VYYGDEQGFT GDGGDQLARQ
TMFASQVPEY RDDDQIGTDR TGADDNFVTD APLYRAIRDL ADLTERDPAL RDGAQQVRYS
SPGAGLFAFS RVDREHRVEY VVALNNSEQA DSGRVPTYLK KGEFRKVYGP GPARLVSRAD
RGLDVALPAL STSVYRSVDR IPRAERAPQI SVARPVPSSI AHGRMHVRAK VSGSSFYEVT
FQRRIGDGAW QTIGVDDNAP YQVFDDVADQ TPGTAVSYRA AVLDNAGHTR GSRARTTAVP
RTSVVVTGPA DGGTVSVVDP VTVTASVDPE RPLQSVAFER SVGDGPWTSL GTDSSAAAYT
VTDDVSDLPL GTEVRYRATL SEAGVVQGTS APVRVTTAEP APAVDSVTVA GSLQSELGCP
EDWQPGCAAT HLTFDRTDGR WHGTFTVPAG DYEWKVAIND SWDVNYGAGG AAGGSNLALP
VPAGGATYVF TWDQVSHEPS VAPAG