Gene Noca_4631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4631 
Symbol 
ID4596087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4911500 
End bp4912660 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID639779240 
Productglycoside hydrolase 15-related 
Protein accessionYP_925813 
Protein GI119718848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCCC TGCACCCCGA CCCGACCACG CTCGACCTGA GTACCGAGGA GCGCCGCCGA 
TTCGCCGACC TCGCGACGGC GAGCCACCGG ATCATCACCG GCCACCAGGA CCCCGGTGGC
GCCTACCCCG CCAGCCCGGG CTTCTCGGCG TACGCCGGCT ACGCCTGGCT GCGCGACGGC
GCGTTCACTG CCGAGGGCAT CTCCCGGTAC GGCGACGCCG ACTCGGCCGG CCGGTTCCAC
GACTGGGCCG CGCGCACCCT GGCGCGGCGG CGCGAGCAGG TGGACGGCCT CCTGGCGATC
CTGGCTGAGG GCCGGAGCCC GGCGCGCGTG GCGATGCTGC CCACGCGGTT CACGTTCGCC
GGCGAAGACG GAACCGACGA CTGGTGGGAC TTCCAGACCG ACGGCTACGG CACCTGGCTG
TGGGCGGTGG TCGCGTTCGC CCGCCGGCAC GGCCAGGGCC TGGACCGGTG GCGCAGCGGC
GTCGAGGTCG CCGTCGACTA CCTGACCGGC TTCTGGTCCT CACCGTGTTA CGACTGGTGG
GAGGAGCACG CCGAGCACCG GCATGTCTCG ACCCTCGCCG CCATCCACGG CGGCCTCCGG
TCGGTTCTCG GGGCCGATGC GTTGGACCCT GCGCGGGCCG ACGCCGCGGC GTCCGCGATC
GCCGAGATCC GCGAGCTGGT CCGCCGGCGG GGTGTTGCGG ACGGCCATCT CACCAAGTGG
CTGGGAACCG ACGCGGTCGA TGCCTCGCTG GCGTCCGCGG TCGTGCCGTT CGGTCTGGTC
TCCGACCACG ACCCGCTCGC GGCCGGAACG CTTCGGGCGG TCGCTGAACA GCTCGACAAC
GGTGGCGGGG TCCACCGGTT CCGCGACGAC GTCTTCTACG GCGGTGGCCA GTGGCTGCTC
CTCTCGGCCC TGCTCGGCTG GAACCTCGCC GAGCGCGGCG AGACCGATGC CGCGCTCCGC
TACCTGCGCT GGGTCGCCGG CCAGGCAACC GCGGCCGGCG AGCTCCCCGA GCAGGTGTCC
GGCCACCTCC TGCACCCCGG GCACCGTCAG GAGTGGATCG ACCGCTGGGG GCCGGTAGCG
ACCCCGCTGC TGTGGTCGCA CGGCATGTAC CTGATCCTCG CCGACACCCT CGGCCTGCTG
CGCGAGGAGG GCGACCGGTG A
 
Protein sequence
MSSLHPDPTT LDLSTEERRR FADLATASHR IITGHQDPGG AYPASPGFSA YAGYAWLRDG 
AFTAEGISRY GDADSAGRFH DWAARTLARR REQVDGLLAI LAEGRSPARV AMLPTRFTFA
GEDGTDDWWD FQTDGYGTWL WAVVAFARRH GQGLDRWRSG VEVAVDYLTG FWSSPCYDWW
EEHAEHRHVS TLAAIHGGLR SVLGADALDP ARADAAASAI AEIRELVRRR GVADGHLTKW
LGTDAVDASL ASAVVPFGLV SDHDPLAAGT LRAVAEQLDN GGGVHRFRDD VFYGGGQWLL
LSALLGWNLA ERGETDAALR YLRWVAGQAT AAGELPEQVS GHLLHPGHRQ EWIDRWGPVA
TPLLWSHGMY LILADTLGLL REEGDR