Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4874 |
Symbol | |
ID | 4595250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | + |
Start bp | 206357 |
End bp | 208153 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639772659 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_919319 |
Protein GI | 119714177 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.0766872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCCTG ACCTGCCAAT CGAGGACTAC GGCCTGCTCG GAGACACCCG GACCGCCGCG CTGGTGAGCT CGCACGGATC CATTGACTGG ATGTGCTTCC CGCGGTTCGA TAGCCAGCCC GTCTTCGGCC GACTCATCGG CGGACCCGCC GCGGGCAGCT ACCGCCTCGG TCCGGCGGGA CCGGCGACGC TGATCAGCCG CCGGTACCAC GACCACTCCG CCACCATCGA GACGACCTGG CGAACCACTG CAGGACGACT CACCCTCACC GAAGGCATGG TCGCCGAGGT CAGCGGCCAG CTCCTGCCCT CGACCATGCT CGTGCGACGC GTGACCGCCC CTGATGCCCC GGTCGATGCC GTCATCGAGA TCGACCTCCG CCTCGGTGAC GGGCACCAGC GCCCGCGCAC CCAGTTCCGT GGCAACGCCT TGGTCTGTGA CTGGCGCGGG CTCGCCACTG CCCTGACAAC CAGCTCGAAC ATGACCGTCG AACCCAGCGT CCCCCACATC GTGACCGTCA CCCCCGGTCG CCCTTTCACC GCCGTACTGA GCTTCGCGGA GCGAGAGCCC TTCGTCCACA TCGATCCGGA CGCCGCCTAC GACGTACTCG AACGTGACGA ACTGCGCTGG CAGGCATGGT CGCGCGACAT CGACCCCGAC CTACCCCACC GCGACGCCGT AGTCCGCAGC CTGCTCACCC TTCGCCTGCT GACCTACTCA CCCTCCGGCG CTCCGGTCGC TGCACCGACC GCCTCCCTTC CGGAAGACCT CGGCGGCGTG CGCAACTGGG ACTACCGGTA CGCGTGGCCG CGAGACGCCA GCATCGGCAT CGGCGCGTTC CTGGGAGTCG GCAAACATGA CGAGGCGCGT GCCTTCCTGG CCTGGCTGCT CAGCGGCACT CGCCTCGACC GTCCCCGACT ACCCGTGCTG CTCACCCTGC ACGGCAAAAC CCCGACGCAC GAACGGACAC TTCCGGGCTG GCCCGGATAC GCCAGCAGCG CACCCGTCCG GATCGGCAAC GCTGCCGCCG ACCAACACCA GCTCGACGGG TACGGCTGGG TGATCGACGC CGCGTGGCTG CTCACCCAGG CAGGGCACCG GCTCTACTCC GAGACCTGGC GCACCATGTC CGGATTCGCC GACACCGTCG CGAGCCGGTG GCGCGAACCG GACGCCGGAA TCTGGGAGGT GCGCACTGAT CCCGCCCACC ACGTGCACTC CAAGATGATG GCTTGGCTGG CTCTCGACCG CGCACTCCGT ATCGCCGCAA GCCACCGAAC GAGCGCGGTG CGACTCACTC GATGGGCCCT CGCCCGTGCG GCCCTGCACC AAGAGATCTC GCAGCGCGGC TTCAACCCCG ACACGAGCAC CTACACGCGC ACCTACGGAT CGGCCGATAC CGACGCCGCA CTGCTGATCC TTCCGCTGCT CAACTTCGAC CCACCAGACT CCCCACGTGT CCGCGGAACG ATCGACGCCA TCACCCGCGA CCTCGACGCC GGCACACCTC TCCTGTACCG ATACCCACCT GGACAAGACG GGCTTCCTGG TAAGGACGGC GCCTTCCTGC CGTGCTCATT CTGGCTCGTC CAAGCGCTCG CCCGAACCGG ACGACAAGAA GAGGCGGAGG AGCTTTTCCA AGAACTCCTG ACGCTGGCAA GCCCCCTTGG GCTATACGCA GAAGAGATGG ATCCCGTTAC GCGTCACCAC CTCGGCAACT ACCCCCAGTC CCTCACCCAC GCAGCCGTGG TTCAGGCGGC GCTCGCCCTC CGAGACGGCG CGGCCGGAAT CCCCTAG
|
Protein sequence | MSPDLPIEDY GLLGDTRTAA LVSSHGSIDW MCFPRFDSQP VFGRLIGGPA AGSYRLGPAG PATLISRRYH DHSATIETTW RTTAGRLTLT EGMVAEVSGQ LLPSTMLVRR VTAPDAPVDA VIEIDLRLGD GHQRPRTQFR GNALVCDWRG LATALTTSSN MTVEPSVPHI VTVTPGRPFT AVLSFAEREP FVHIDPDAAY DVLERDELRW QAWSRDIDPD LPHRDAVVRS LLTLRLLTYS PSGAPVAAPT ASLPEDLGGV RNWDYRYAWP RDASIGIGAF LGVGKHDEAR AFLAWLLSGT RLDRPRLPVL LTLHGKTPTH ERTLPGWPGY ASSAPVRIGN AAADQHQLDG YGWVIDAAWL LTQAGHRLYS ETWRTMSGFA DTVASRWREP DAGIWEVRTD PAHHVHSKMM AWLALDRALR IAASHRTSAV RLTRWALARA ALHQEISQRG FNPDTSTYTR TYGSADTDAA LLILPLLNFD PPDSPRVRGT IDAITRDLDA GTPLLYRYPP GQDGLPGKDG AFLPCSFWLV QALARTGRQE EAEELFQELL TLASPLGLYA EEMDPVTRHH LGNYPQSLTH AAVVQAALAL RDGAAGIP
|
| |