Gene Noca_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1303 
Symbol 
ID4598925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1377044 
End bp1378342 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID639775897 
Productextracellular solute-binding protein 
Protein accessionYP_922504 
Protein GI119715539 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.313179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGCA CGCGACGCGC GCTGAGCGCG CTGGTTGCGG CTGGTGCCCT CGTGCTCCTG 
GCTGCTTGCG GGGGAAGCGA CAGCGAAGGC AACAACGACA CCTCGTCCAA CGGCGACGCC
GCCAGTGGCG AGGTCGAGGT CTTCACCTGG TGGGCCGAGG GCAGCGAGAA GGCCGGCCTC
GACGCGCTGG TGGCGGTCTT CAACGACCAG TACCCGGACA TCAAGTTCGT CAACGGTGCG
GTCGCCGGCG GCGCCGGTAG CGACGCGAAG AACGTGTTGG CCTCCCGGCT GCAGACCAAC
GACCCGCCCG GAACCTTCCA GGCGCACGCG GGCGCGGAGC TGACCGACTA CATCAACAAC
GGGCAGATCG AGGACCTGAC CCAGATGTAC GAGGACAACG GCTGGAACGA CTACTTCCCG
CAGACCCTGC TCGACCGGCT GGAGCAGGAC GGCAAGATCT ACTCCGTCCC GTCCAACATC
CACCGGGCGA ACGTGGTCTG GGCCAACCCG GCCGTGCTGA AGGACGCCGG CGTCGACCCC
GAGGCGACGT ACGCGAGCCT CGACGACTGG ATCTCCGACC TGCAGAAGAT CAAGGCCAAA
GGCTTGATCC CGCTCTCGAT CGCGACCGAC TGGACCCAGG TCCACCTGTT CGAGACCGTG
CTGCTCGCCG ACCTCGGCGC CGACGCCTAC AACGGCCTCT GGGACGGCAC GACCGACTGG
GCCGGCGACG AGGTCGGCGC CGCGCTCGAG GACTACCAGA CCCTGTTCGA GCTCACCAAC
CAGGACCGGC AGTCGCTCGA CTGGCCGGAC GCCACCCAGC TCGTGATCGA CGGCGACGCC
GCGTTCAACG TGATGGGTGA CTGGGCCGTC GCGGCCTTCG AGGAGCAGGG CAAGAAGCTG
GGCACGGACT ACCTGGCCTA CCCGGTGCCC GGCACCGACG GTGTCTTCGA CTTCCTCGCG
GACTCCTTCA CCCTCCCGGT CGGGGCTCCC GACCCCGCCA CCACCGAGGC CTGGCTCTCC
ACCGTGGCCA GCCCCGAGGG TCAGACCGCC TTCAACATGA AGAAGGGCTC GATCCCGGCC
AACACGCAGG CCGACACCTC CGGGTTCGGT GACTACCAGC AGACCGCGAT CGAGTCGTAC
GCCAACGACG ACATCGTCTC GTCGCTGGCC CACGGCGCAG CGGCCCCGAT CAGCTGGCTG
ACCGACATCA CCTCGGCCGT CGCCAAGTAC GGTTCGACCG GTGACCTCGG TGGCTTCCAG
GACGACCTGG CTGCCGCCGC TGAGAAGAAC GCCGGCTGA
 
Protein sequence
MNRTRRALSA LVAAGALVLL AACGGSDSEG NNDTSSNGDA ASGEVEVFTW WAEGSEKAGL 
DALVAVFNDQ YPDIKFVNGA VAGGAGSDAK NVLASRLQTN DPPGTFQAHA GAELTDYINN
GQIEDLTQMY EDNGWNDYFP QTLLDRLEQD GKIYSVPSNI HRANVVWANP AVLKDAGVDP
EATYASLDDW ISDLQKIKAK GLIPLSIATD WTQVHLFETV LLADLGADAY NGLWDGTTDW
AGDEVGAALE DYQTLFELTN QDRQSLDWPD ATQLVIDGDA AFNVMGDWAV AAFEEQGKKL
GTDYLAYPVP GTDGVFDFLA DSFTLPVGAP DPATTEAWLS TVASPEGQTA FNMKKGSIPA
NTQADTSGFG DYQQTAIESY ANDDIVSSLA HGAAAPISWL TDITSAVAKY GSTGDLGGFQ
DDLAAAAEKN AG