Gene Noca_0267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0267 
Symbol 
ID4596151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp285767 
End bp287506 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content73% 
IMG OID639774880 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_921499 
Protein GI119714534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.640878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCACG GACCGTTCGA CGTCTGGGCG CCGCTGCCCC GCCGGTTGCG CCTGGCCGTC 
GCCGGCGAGA CCGTCGCGAT GCGCCGCGGC GAGGGCGACT GGTGGACCCC CGCGGGGCCG
GTCCCTGCCG GCGCAGAGCT CGACTACGGC TACCTCGTGG ACGACGGCGA GGCACCGCGG
CCCGATCCGC GCTCGCGCCG CCAGCCGGGT GGGGTCCACC AGCTGTCGCG GACCTTCGAC
CCCGCGTCGT ACCCCTGGAC GGACCAGGCC TGGACCGGCC GCCAGCTGGC CGGCTCGGTG
GTCTACGAGC TGCACGTCGG CACGTTCACG CCCGAGGGCA CCTTCGACGC CGCGCTCGAG
AAGCTCCCTC ACCTGCGCGA GATCGGCGTC GACCTGGTCG AGCTGATGCC GGTCAACGCC
TTCAACGGCA CCCACAACTG GGGCTATGAC GGCGTCGGCT GGTTCGCCGT CCACGAGGGG
TACGGCGGGC CCGCGGGCTA CCAGCGCTTC GTCGACGGCT GCCACGCGGC CGGGCTGGGC
GTCATCCAGG ACGTGGTCTA CAACCACCTC GGTCCGTCCG GGAACTACCT GCCGGAGCTC
GGCCCGTACC TCAAGCAGGG GGCGAACACC TGGGGCGACT TCTTGAACCT CGACGGCCCG
GGCTCGGACG AGGTGCGGCG CTACATCCTC GACAACGTGC GGATGTGGCT CGCGGACTAC
CACGTCGACG GCCTGCGGCT CGATGCCGTG CACGCGCTCA ACGACGCCGG CCACCAGCAC
CTGCTCGAGG AGCTGGCGGC CGAGGTCGCG GCCCTGTCCG CCCACCAGCG GCGGCCGTTG
ACGCTGATCG CCGAGTCCGA CCTCAACGAC ACCCGGCTGG TCCGCCCGCG CGAGGCCGGC
GGCTACGGCC TGGACGCGCA GTGGAGCGAC GACTTCCACC ACGCCGTGCA CGTCGCCCTG
ACCGGCGAGA CCGCCGGCTA CTACGCCGAC TTCGAGCCGC TCTCGGCGCT GGCGAAGGTC
TGCGAGCGCG GGTTCTTCCA CGACGGCACC TGGTCGTCGT TCCGCGGCCG CGACCACGGC
TTCCCCGTCG ACCTGCGCAC GATGCCGACC TGGCGGCTGG TGGTCTGCAG CCAGAACCAC
GACCAGATCG GCAACCGCGC CCGGGGCGAC CGGATCACCG AGGTGCTCGA CGACGACCAG
CTCGCCTGCG CCGCGCTGCT GACCCTGTGC GGACCGTTCA CGCCGATGCT GTTCATGGGC
GAGGAGTGGG CGGCCTCGAC GCCGTTCCAG TTCTTCACCT CCCACCCCGA GCCCGACCTC
GGGACGGCGA CCGCCGAGGG CCGGATCGCC GAGTTCGAGC GGATGGGCTG GGACCCCGCC
GTCGTACCCG ACCCCCAGGA CCCGGCAACG TTCGAGCGCT CGAAGCTCGA CTGGGCGGAG
GCGACCCGGG GCCGGCACGC CCGGATGCTC GACGTCTACC GCCGGTTGGC GCGGCTGCGC
CGCGAGCACG AGGCGCTCAC GGACCCGTCG TTCGGCTCGG TGCGGTGCAC GGCCGACGAG
CACACCCGGG TCTTCACGAT GCGGCGCGGC GACCTGCTGG TCGCGGTGAA CTTCGGCGAC
GCCGCGGCGA CGGTGGAGAC CGCCGCCGCG ACGCTGCTGT TCCAGACCGG CGCCGGCCTC
ACGCTCGACG CCGGCCGGCT GTCGCTGCCC CCGCACGCGG GAGCCCTGGT CGGACCCTGA
 
Protein sequence
MTHGPFDVWA PLPRRLRLAV AGETVAMRRG EGDWWTPAGP VPAGAELDYG YLVDDGEAPR 
PDPRSRRQPG GVHQLSRTFD PASYPWTDQA WTGRQLAGSV VYELHVGTFT PEGTFDAALE
KLPHLREIGV DLVELMPVNA FNGTHNWGYD GVGWFAVHEG YGGPAGYQRF VDGCHAAGLG
VIQDVVYNHL GPSGNYLPEL GPYLKQGANT WGDFLNLDGP GSDEVRRYIL DNVRMWLADY
HVDGLRLDAV HALNDAGHQH LLEELAAEVA ALSAHQRRPL TLIAESDLND TRLVRPREAG
GYGLDAQWSD DFHHAVHVAL TGETAGYYAD FEPLSALAKV CERGFFHDGT WSSFRGRDHG
FPVDLRTMPT WRLVVCSQNH DQIGNRARGD RITEVLDDDQ LACAALLTLC GPFTPMLFMG
EEWAASTPFQ FFTSHPEPDL GTATAEGRIA EFERMGWDPA VVPDPQDPAT FERSKLDWAE
ATRGRHARML DVYRRLARLR REHEALTDPS FGSVRCTADE HTRVFTMRRG DLLVAVNFGD
AAATVETAAA TLLFQTGAGL TLDAGRLSLP PHAGALVGP