Gene Noca_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3914 
Symbol 
ID4598049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4119473 
End bp4120846 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content68% 
IMG OID639778520 
Productextracellular solute-binding protein 
Protein accessionYP_925099 
Protein GI119718134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.313486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCAC GCAAGACGCG CCGGGGGATG GCAGTCGCTG CCGCTCTCGT GAGCGCAGGA 
CTCGTTCTCG CCGGCTGTGG CGGGAGCGAC GACGGTGGTA GTGGCGAGGC CAGCGGCAAG
TCGCCGGGCG AGGGCAAGGC CGAGTGCGAG CAGCTCACGC AGTTCGGTGA CCTGACCGGC
AAGGACGTCA CGGTCTACAC CTCGATCGTG GCGCCCGAGG ACAAGCCCCA CATCGACTCC
TGGAAGGTCT TCGAGGACTG CACCGGCGCC GATGTGAAGT ACGAGGGCTC GAAGGAGTTC
GAGACCCAGC TGCAGGTGCG CGTCCAGTCG GGCAACCCGC CGGACATCGC GTACGTCCCG
CAGCCCGGCC TGCTCCAGAC CCTGGTCGGC ACCGGCAAGG TCGTCGAGGC CCCCGACACG
GTCTCGGCCA ACGTCGACAA GTGGTTCGGT GAGGACTGGC GCTCGTACGG CAGCGTGGAC
GGCAAGCTGT ACGCCGCCCC GCTGGGCGCG AACGTGAAGT CCTTCGTGTG GTACTCCCCC
AAGATGTTCG CCGAGAACGG CTGGGAGATC CCGACGACGT GGGACGACAT GCTCGCCCTG
TCCGACACGA TCACCGCGAC CGGCATCAAG CCGTGGTGCG CGGGCATCGA GTCCGGCGAG
GCCACCGGCT GGCCGGCCAC CGACTGGCTC GAGGACGTGC TGCTCCGCTC GGTCGGTCCG
GACGTCTACG ACCAGTGGGT CGCCCACGAG ATCCCCTTCA ACGACCCCGC GGTCGTCGAG
AGCCTCGACA ACGTCGGCGC GATCCTGAAG AACGACAAGT ACGTCAACGG CGGCATCGGT
GACGTCAGCT CGATCGCCAC GACCGCGTTC CAGGACGGCG GCCTGCCGAT CCTCGACGGC
AAGTGCGCCC TGCACCGCCA GGCGAGCTTC TACGCCGCCA ACTGGCCCGA GGGCACCGAC
GTCTCGGAGA ACGGCGACGT GTTCGCGTTC TACCTGCCGG CCATGGGCGA CGAGTTCGGC
AACCCGGTCC TCGGCGGCGG CGAGTTCGTC GCAGCGTTCT CGGACGCGAT CGAGGTCCAG
GCCTTCCAGA CCTACCTGTC CAGCGACCAG TGGGCCAACG AGAAGGCCAA GGCCACCCCG
AACGGCGGCT GGGTCAGCGC GAACAAGGGC CTGGACATCG CCAACCTGGC GAGCCCGGTC
GACAAGCTCT CCGGCGAGAT CCTGCAGGAC CCGGACGCGG TCTTCCGCTT CGACGGGTCC
GACATGATGC CGGGTGAGGT CGGCGCTGGT TCGTTCTGGA AGGAAATGAC CAACTGGATC
ACCGGCGAGA GCACCCAGGA CGCGCTCGAC AAGATCGAGG CCTCCTGGCC GTGA
 
Protein sequence
MRSRKTRRGM AVAAALVSAG LVLAGCGGSD DGGSGEASGK SPGEGKAECE QLTQFGDLTG 
KDVTVYTSIV APEDKPHIDS WKVFEDCTGA DVKYEGSKEF ETQLQVRVQS GNPPDIAYVP
QPGLLQTLVG TGKVVEAPDT VSANVDKWFG EDWRSYGSVD GKLYAAPLGA NVKSFVWYSP
KMFAENGWEI PTTWDDMLAL SDTITATGIK PWCAGIESGE ATGWPATDWL EDVLLRSVGP
DVYDQWVAHE IPFNDPAVVE SLDNVGAILK NDKYVNGGIG DVSSIATTAF QDGGLPILDG
KCALHRQASF YAANWPEGTD VSENGDVFAF YLPAMGDEFG NPVLGGGEFV AAFSDAIEVQ
AFQTYLSSDQ WANEKAKATP NGGWVSANKG LDIANLASPV DKLSGEILQD PDAVFRFDGS
DMMPGEVGAG SFWKEMTNWI TGESTQDALD KIEASWP