Gene Noca_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3683 
Symbol 
ID4597600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3905693 
End bp3906991 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID639778291 
Productextracellular solute-binding protein 
Protein accessionYP_924870 
Protein GI119717905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC GAACTGTGCG CCGCACGGTG GCTGTCACAG CTGCCGTCGC GGCGCTGTCC 
CTGACGGCGG CATGCTCTGG GGGCAAGGGT GCGCCGGGGT CCGAGACCCC CAGCGGTGAC
GGAGACGTGA GCTCGAACGT CGAAGGGACC GTCCGGGTCC TCATGGAGGG CGTGCCCGAC
ACCGACATCG TCGAGGGGAT GATCGGCGAG TTCAACGAGC AGTACCCCAA CGTGAAGGTC
CAGATCGAGA CCGCCGTCTA CGACCAGATG CGTGACAAGT ACGTGGCGTC CTTCACCGCA
CCTGAGTCGT CCTACGATCT CGCGATCATC GACAACCCCT GGATGGGCGA CTTCGCGAAG
GCCGGATTCC TGACCCCACT GGACTCGTAC ATCGAGTCGA CGTCGGGTTA CGACTACGAG
GACTTCGCCG AACCTCTGCG CCAGATCAAC GAGGTCGACG GCAAGACCTA CGGTATTCCG
TTCTACAACT ACGGCTTGGG GCTGATCTAC CGCACCGATC TCCTCTCGGC AGCGCCCTCC
ACACTGGATG AACTCGTTGC TGCCGCCCAG GAGAACACCA CTGACACTCG GGCTGGCATT
GCGATGCAGC CCCAGCGCGG CTACAAGGCA TTTGAGGAAT GGGCCAACTT CCTCTTTGCC
GCTGGTGGCT CCATCTACGA CGACGAAGGA AATCTCAGCC TGGACACCCC CGAGGCGAAG
GAGGCCCTCG AGACCTACAT CGAGCTCTAC GAGACCGCGG CTCCCGCCAG CAGCCTCAAC
TGGGCCTTCG ACGAGGCGCT GCGATCCGTG AGCAGCGACA AGGCGGCCAT GATGGTCTCC
TACAACTGGA TGCTCCCCAC CCTCAACGCT GACGACTCGC CCGCCGGCGA TCTCGCTGGC
AAGTTCGCTT TGGCGACCAT GCCGGGCGGC AAGCAAGTCC TCGGTTCATG GAGTTGGGCC
ATCCCCTCCA ACAGTGAGAC GGACGATGCC GACTGGGCGT TCATCTCTTG GCTGACCTCT
GCCGACGGTG AGAAGCAGCG AGTGGAGGCC GGTGGCGCAC CCGTCCGGCA GAGCGTCCTG
ACTGATCCGC AGGTGGCGGC CCAGGGCTTC GGTGCTGACT ACTACGCCAC TGTCGGTGAC
ATCCTCGCCA ACTCGGCCCC CCTGTGCCAG GGCGCCAACT GCGACGAGAT GATCCAGGCG
GTCGGAACCG AGCTCAGCGC CGCAGTCTCC GGACAGAAGA GCGTGGCAGA CGCCCTCTCT
GCGGCCCAGG AGCAGGCGAC TCGGATCCAG TCCAGCTGA
 
Protein sequence
MKNRTVRRTV AVTAAVAALS LTAACSGGKG APGSETPSGD GDVSSNVEGT VRVLMEGVPD 
TDIVEGMIGE FNEQYPNVKV QIETAVYDQM RDKYVASFTA PESSYDLAII DNPWMGDFAK
AGFLTPLDSY IESTSGYDYE DFAEPLRQIN EVDGKTYGIP FYNYGLGLIY RTDLLSAAPS
TLDELVAAAQ ENTTDTRAGI AMQPQRGYKA FEEWANFLFA AGGSIYDDEG NLSLDTPEAK
EALETYIELY ETAAPASSLN WAFDEALRSV SSDKAAMMVS YNWMLPTLNA DDSPAGDLAG
KFALATMPGG KQVLGSWSWA IPSNSETDDA DWAFISWLTS ADGEKQRVEA GGAPVRQSVL
TDPQVAAQGF GADYYATVGD ILANSAPLCQ GANCDEMIQA VGTELSAAVS GQKSVADALS
AAQEQATRIQ SS