Gene Noca_4874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4874 
Symbol 
ID4595250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp206357 
End bp208153 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content68% 
IMG OID639772659 
Productglycoside hydrolase 15-related 
Protein accessionYP_919319 
Protein GI119714177 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0766872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCTG ACCTGCCAAT CGAGGACTAC GGCCTGCTCG GAGACACCCG GACCGCCGCG 
CTGGTGAGCT CGCACGGATC CATTGACTGG ATGTGCTTCC CGCGGTTCGA TAGCCAGCCC
GTCTTCGGCC GACTCATCGG CGGACCCGCC GCGGGCAGCT ACCGCCTCGG TCCGGCGGGA
CCGGCGACGC TGATCAGCCG CCGGTACCAC GACCACTCCG CCACCATCGA GACGACCTGG
CGAACCACTG CAGGACGACT CACCCTCACC GAAGGCATGG TCGCCGAGGT CAGCGGCCAG
CTCCTGCCCT CGACCATGCT CGTGCGACGC GTGACCGCCC CTGATGCCCC GGTCGATGCC
GTCATCGAGA TCGACCTCCG CCTCGGTGAC GGGCACCAGC GCCCGCGCAC CCAGTTCCGT
GGCAACGCCT TGGTCTGTGA CTGGCGCGGG CTCGCCACTG CCCTGACAAC CAGCTCGAAC
ATGACCGTCG AACCCAGCGT CCCCCACATC GTGACCGTCA CCCCCGGTCG CCCTTTCACC
GCCGTACTGA GCTTCGCGGA GCGAGAGCCC TTCGTCCACA TCGATCCGGA CGCCGCCTAC
GACGTACTCG AACGTGACGA ACTGCGCTGG CAGGCATGGT CGCGCGACAT CGACCCCGAC
CTACCCCACC GCGACGCCGT AGTCCGCAGC CTGCTCACCC TTCGCCTGCT GACCTACTCA
CCCTCCGGCG CTCCGGTCGC TGCACCGACC GCCTCCCTTC CGGAAGACCT CGGCGGCGTG
CGCAACTGGG ACTACCGGTA CGCGTGGCCG CGAGACGCCA GCATCGGCAT CGGCGCGTTC
CTGGGAGTCG GCAAACATGA CGAGGCGCGT GCCTTCCTGG CCTGGCTGCT CAGCGGCACT
CGCCTCGACC GTCCCCGACT ACCCGTGCTG CTCACCCTGC ACGGCAAAAC CCCGACGCAC
GAACGGACAC TTCCGGGCTG GCCCGGATAC GCCAGCAGCG CACCCGTCCG GATCGGCAAC
GCTGCCGCCG ACCAACACCA GCTCGACGGG TACGGCTGGG TGATCGACGC CGCGTGGCTG
CTCACCCAGG CAGGGCACCG GCTCTACTCC GAGACCTGGC GCACCATGTC CGGATTCGCC
GACACCGTCG CGAGCCGGTG GCGCGAACCG GACGCCGGAA TCTGGGAGGT GCGCACTGAT
CCCGCCCACC ACGTGCACTC CAAGATGATG GCTTGGCTGG CTCTCGACCG CGCACTCCGT
ATCGCCGCAA GCCACCGAAC GAGCGCGGTG CGACTCACTC GATGGGCCCT CGCCCGTGCG
GCCCTGCACC AAGAGATCTC GCAGCGCGGC TTCAACCCCG ACACGAGCAC CTACACGCGC
ACCTACGGAT CGGCCGATAC CGACGCCGCA CTGCTGATCC TTCCGCTGCT CAACTTCGAC
CCACCAGACT CCCCACGTGT CCGCGGAACG ATCGACGCCA TCACCCGCGA CCTCGACGCC
GGCACACCTC TCCTGTACCG ATACCCACCT GGACAAGACG GGCTTCCTGG TAAGGACGGC
GCCTTCCTGC CGTGCTCATT CTGGCTCGTC CAAGCGCTCG CCCGAACCGG ACGACAAGAA
GAGGCGGAGG AGCTTTTCCA AGAACTCCTG ACGCTGGCAA GCCCCCTTGG GCTATACGCA
GAAGAGATGG ATCCCGTTAC GCGTCACCAC CTCGGCAACT ACCCCCAGTC CCTCACCCAC
GCAGCCGTGG TTCAGGCGGC GCTCGCCCTC CGAGACGGCG CGGCCGGAAT CCCCTAG
 
Protein sequence
MSPDLPIEDY GLLGDTRTAA LVSSHGSIDW MCFPRFDSQP VFGRLIGGPA AGSYRLGPAG 
PATLISRRYH DHSATIETTW RTTAGRLTLT EGMVAEVSGQ LLPSTMLVRR VTAPDAPVDA
VIEIDLRLGD GHQRPRTQFR GNALVCDWRG LATALTTSSN MTVEPSVPHI VTVTPGRPFT
AVLSFAEREP FVHIDPDAAY DVLERDELRW QAWSRDIDPD LPHRDAVVRS LLTLRLLTYS
PSGAPVAAPT ASLPEDLGGV RNWDYRYAWP RDASIGIGAF LGVGKHDEAR AFLAWLLSGT
RLDRPRLPVL LTLHGKTPTH ERTLPGWPGY ASSAPVRIGN AAADQHQLDG YGWVIDAAWL
LTQAGHRLYS ETWRTMSGFA DTVASRWREP DAGIWEVRTD PAHHVHSKMM AWLALDRALR
IAASHRTSAV RLTRWALARA ALHQEISQRG FNPDTSTYTR TYGSADTDAA LLILPLLNFD
PPDSPRVRGT IDAITRDLDA GTPLLYRYPP GQDGLPGKDG AFLPCSFWLV QALARTGRQE
EAEELFQELL TLASPLGLYA EEMDPVTRHH LGNYPQSLTH AAVVQAALAL RDGAAGIP