Gene Noca_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3688 
Symbol 
ID4597605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3911251 
End bp3912651 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID639778296 
Productextracellular solute-binding protein 
Protein accessionYP_924875 
Protein GI119717910 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGGCGA GTCGGGGGCG CCGCGCGTTG CTGGGCGCCG TGGCCGTGGT CGGGCTGGCC 
CTCTCGGCGG TGGGCTGCAC GGGCGACGGC GCGACGCCGG GGCCGAAGCC GGCCACCAGC
CCGGTCCAGC CGGAGGTGTC CCGGCTGACG TTCGGGGTCT ACGGAGCGCC GGCCGAGATC
GCGGCGTACC GCGCCACCGT CGACGCCTGG AACGCCGCCG GCGCGGAGCA GGACCGACCC
GAGGTCAAGC TGCGCTCCTG GCCCGATCAC GCGGCGATGC GTGCCGACAT CGACTCCGGT
GCCCCGGTGC CCGACGTGTT CCTGGCCTCG CGGTCGGACC TGAGCTGGCT GCTCGAGAAC
CGGCACAACC AGCCGGTCGA CGAGCTGCTC GACGAGCGTG GCGTCGAGTT CGGCGACCAG
TACTCCCGCG ACTCGATCCA GGCGTTCAGC GCCGATGACC GGCTGCAGTG CATGCCGTAC
GCCGTCTCCC CGATGGTGAT CTACTACAAC CGCGACCTGG TGAACTTCAA CCGGATGCGC
AAGCGCGGCC TGGACGCCCC GGACCAGGAC GCCAAGAGCT GGTCGTTCGA TCAGTTCGCC
GCCGCCGCCG ACTTCGCCAC CCGTCCCGGG CGCGGCACCA AGGGCGTGCA CATCGCGGCG
ACCCTGCCCG GGCTGGCGCC GTTCATCGGC TCCGGTGGCG GCTCGGTGTA CGACGACAAC
ACCGACCCGA CCTCGCTGGC CTTCTCCAGT GACGGGACCC GCTCGGCGTT GGAGCGCACC
CTCGAGCTGC TCCGCAACCC GCAGGTCACC CTCGACGACG ACCAGCTCGC CGAGGCCAGT
CCGCTGACCT GGTTCGAGCG CGGCCGCCTC GGCATGATCG CCGGCTACCG CTCGCTGGTG
CCGGAGTTGC GCGGGGTCGA CGACCTGGAC TTCGACGTGA TGCCGATGCC GGTGCTCGAC
AGCTCCTCGA CCGTCGGCGA CGTCACCGGC CTGTGCCTGT CCCGCACCTC CGACAGCGTC
CCGCTGGCGG CGGACTTCTT GATCCACGAG ATCTCCACCG AGGCCGTGAG CCGGGTGACT
CGCACCGGCT ACCTCGCCCC CGCCAACCTG GAGGTGGCGC TCTCGGACAC GTTCCTCCAG
CCCGGCCGGG AGCCGCTCCA CGCGGCGTTC TTCAACTCGA CGGTCCGCTC GATCGACCTG
CCGCCGCTGA TCGACACCCT CGGCCGGCTC GAGGCGGCGG TGCAGCCGAG CCTCGAGCAG
CTCGTCTACG GCATCGGCGT ACTCGACCTG GAGGGCCTCA CCGAGCAGAT CGACGAGGAG
TCCCGGGCGG TCCTCAGCCC GCCCGAGCCC AGCGAACCAC CCAGCCCGAC AGGGCGATCT
GCGGCGACGC CCTCGTCCTA G
 
Protein sequence
MRASRGRRAL LGAVAVVGLA LSAVGCTGDG ATPGPKPATS PVQPEVSRLT FGVYGAPAEI 
AAYRATVDAW NAAGAEQDRP EVKLRSWPDH AAMRADIDSG APVPDVFLAS RSDLSWLLEN
RHNQPVDELL DERGVEFGDQ YSRDSIQAFS ADDRLQCMPY AVSPMVIYYN RDLVNFNRMR
KRGLDAPDQD AKSWSFDQFA AAADFATRPG RGTKGVHIAA TLPGLAPFIG SGGGSVYDDN
TDPTSLAFSS DGTRSALERT LELLRNPQVT LDDDQLAEAS PLTWFERGRL GMIAGYRSLV
PELRGVDDLD FDVMPMPVLD SSSTVGDVTG LCLSRTSDSV PLAADFLIHE ISTEAVSRVT
RTGYLAPANL EVALSDTFLQ PGREPLHAAF FNSTVRSIDL PPLIDTLGRL EAAVQPSLEQ
LVYGIGVLDL EGLTEQIDEE SRAVLSPPEP SEPPSPTGRS AATPSS