Gene Noca_0954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0954 
Symbol 
ID4597487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1003517 
End bp1006012 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content75% 
IMG OID639775557 
Productglycoside hydrolase family protein 
Protein accessionYP_922164 
Protein GI119715199 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGTGCG AGCATGCCGC CATGCGTGCC CGCTCCTTGC GCTCGACCGC TGCCGGACTC 
GCCCTCGTCG CCGGCCTGCT GGTGACCGGT GCGCCGGCCA CCTCGGCCGC CGACGGTCCG
GACGGCCGGA CGTCGGGCGG GTCGGCGCCG GACCGGGCCG CGCCCGACGG GTCCGACGGG
ATGCACGCCG TGGTGAGCGC CCCCATCGGC GGCGACTGGT CGGTGCGCTT CGTCGACCGC
GACGGCACCG TGTTGGCCAC GGTCGCGCGG GACGCGATCG CGCTGGTCAC GGCGGCCGGG
CGGGTGCCCG CCGACCACGT GGTGGCCGTG CACGACGACG ACGTCGTCGA GCTCGGCACG
GCCGATCCCG CGCTCACCGC GAGCGTGCGG GTCAGCCCTG CGGGCGACGG CGCGTACGCC
GTGGCGGTGG CCGGGTCGGG CAGCGGGATC ACTGCGGTGT CGATCGACTT CCGCGCGCCC
CGCGACGAGC GCTACCTCGG CCTGGGCGAG CGCTCGGACG CCGTGGACCA TCGCGGCCGT
GAGGTGCAGA ACCGGGTCCT GGACGGCCCC TACACGACCA GCCAGGCGCT GCTCGTGTCC
GGCTTCGTCC CGCAGCCCGG CTACAGCAGC CGCGCGGACG CGACGTACTT CCCGGTGCCG
TGGGTGCTCT CGACCGCCGG GTACGGCGTG CTCGTCGACA ACGACGAGGA CAGCTCCTTC
GAGCTCGCGA CGCGGCAGCA CCCGGACGTG AGCCGGCTGG TGGTCCGGTC CGACCGGCTC
GACCTGCGCG TCTTCTCGGG ACCGACGCCG GCGCGGGCCC TGGCCCGGAT GACCGGTGCG
GTCGGCCGGC AGCCCGCCCC GGCCAGCCCG ATGGTCTACG GCGCCTGGTG GCAGCCGGTC
GGTGATGCCG TGGCCGGGCT CGCGGAGCAG CGCGACCGCG ACGTCGCGAT CTCGGTCGCC
CAGACCTACG TCCACTACCT GCCCTGCGGG GCCCAGGACT CCGCGCGCGA GCGCGCCCTC
ACCGAGGCGC TGCACGGCCG GGGCGTGGGG GTGACGACGT ACTTCAACCC GATGGTCTGC
ACGAGCTACC AGCCGGTCTA CGACGAGGGC GTCGCCGCGG GCGCGTTCAC CCGCAACCCC
GACGGCTCAC CGCTGGTCTA CCGCTACTCG ACCGCGACGA ACTTCCGGGT CTCCCAGCTC
GACTTCTCCG CCGCGCCCGG CCGCGACCTC TTCCACCAGC TGCTCGACGA GGCGGTCGCG
GACGGCTACG ACGGCTGGAT GGAGGACTTC GGCGAGTACA CCCCCGCGAC CGCGGTGTCG
GCCGACGGCA CCCCGGGGCC GACGATGCAC AACCGGTACG TCGAGCAGTA CCACGCCGCC
GCCCGCGACT TCGAGACCCA GGCATCGCGG CCGCTGCTGC GGTTCAACCG CTCCGGGTGG
ACCGATGCCA TCAAGGAGTC CTCGATCGTG TGGGGCGGTG ACCCCACCAC CTCGTGGGAC
TTCGACGGCC TGTCCTCCTC GGTCCGCCAG GGCCTGACCA GCGGCACCTC CGGGCTGTCC
TTCTGGGGGC CGGACATCGG CGGCTTCTTC ACCCTGCCCG GCGACCCGAC GCTGACACCC
GAGCTGCTGG CCCGATGGAT CGAGTACGGC GCCTTCACCG GCGTGATGCG GCTGCAGTCC
GGAGGCATCT CGATCGGTGT CTCCGGCGAA CGGCCCATGG TCACCGACCC GACGGTTGCC
CCGGTGTGGA AGCGCTACAC CCGCCTGCGC ACGATGCTCT ACCCCTACAT CGCCGGCAGC
CAGGACGCCT ACCTGCAGCG GGGACTGCCG CTGATGCGGC ACCTCGCCCT GGTCCATCCC
GCTGACGGGC AGGCCGTGCG CGCGGACGAC GAGTACCTCT TCGGCCGCGA CCTGCTCGTG
GCCCCGGTGA CCAGTCCGGG CGCCAGCACG CGACCGGTGT ACCTCCCCCG CGGTCACTGG
ATCGAGCTCG CCCGTGCCTG GCGGCTGCGC GACGACGGGC GCTTCGGCCT GCGGCACGCC
GACGTGGTGC GCGGGCACCG CACCGTGACC GCCCGGGCAC CGCTCGGCAC CATCCCGCTG
TTCCTGCGCG CCGGCGCGGT CGTGCCGCTG CTGCCGCGCG GCGTCGACAC CCTCAGCGAC
TACGGCGACG GCATCGTCCG GCTGGCCGAC CGGGCCGCTC GGCGTACCCT CCTCGCCGCG
CCGCGGTCCG GGACCTGGCG GGGCGCCCTC GGCCCGGGCG AGACGCTGCG CTCGGAGGTC
ACCCGTGGGT CCTGGACGCT ACGGCTCGAC GCGGCGGACG CGCGGACGTA CGCCGTGCGG
GCGACGCTGG CGGGCCTCGA CCCCGCCTGG CGGCCGTGCG AGGTCCGGGC CGACGGCGCC
CGGGTGCGGT TCGAGTACGC GCCCGGCCGG CAGGTGCTGC GGTTCAGCGC CGGTCCCGCC
GCCGGAGGCG AGGTGCGGGT CAGCGCCTGC CGCTAG
 
Protein sequence
MSCEHAAMRA RSLRSTAAGL ALVAGLLVTG APATSAADGP DGRTSGGSAP DRAAPDGSDG 
MHAVVSAPIG GDWSVRFVDR DGTVLATVAR DAIALVTAAG RVPADHVVAV HDDDVVELGT
ADPALTASVR VSPAGDGAYA VAVAGSGSGI TAVSIDFRAP RDERYLGLGE RSDAVDHRGR
EVQNRVLDGP YTTSQALLVS GFVPQPGYSS RADATYFPVP WVLSTAGYGV LVDNDEDSSF
ELATRQHPDV SRLVVRSDRL DLRVFSGPTP ARALARMTGA VGRQPAPASP MVYGAWWQPV
GDAVAGLAEQ RDRDVAISVA QTYVHYLPCG AQDSARERAL TEALHGRGVG VTTYFNPMVC
TSYQPVYDEG VAAGAFTRNP DGSPLVYRYS TATNFRVSQL DFSAAPGRDL FHQLLDEAVA
DGYDGWMEDF GEYTPATAVS ADGTPGPTMH NRYVEQYHAA ARDFETQASR PLLRFNRSGW
TDAIKESSIV WGGDPTTSWD FDGLSSSVRQ GLTSGTSGLS FWGPDIGGFF TLPGDPTLTP
ELLARWIEYG AFTGVMRLQS GGISIGVSGE RPMVTDPTVA PVWKRYTRLR TMLYPYIAGS
QDAYLQRGLP LMRHLALVHP ADGQAVRADD EYLFGRDLLV APVTSPGAST RPVYLPRGHW
IELARAWRLR DDGRFGLRHA DVVRGHRTVT ARAPLGTIPL FLRAGAVVPL LPRGVDTLSD
YGDGIVRLAD RAARRTLLAA PRSGTWRGAL GPGETLRSEV TRGSWTLRLD AADARTYAVR
ATLAGLDPAW RPCEVRADGA RVRFEYAPGR QVLRFSAGPA AGGEVRVSAC R