Gene Noca_0869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0869 
Symbol 
ID4599876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp907548 
End bp908882 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID639775470 
Productextracellular solute-binding protein 
Protein accessionYP_922079 
Protein GI119715114 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.509231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCA CATCACCCGT CTCGGGCTCG CTCGCCGGGA TCACCGCCCC CCGGCCGACG 
CGTCGTGGCC TGCTCGTGGG CGGAGGTGCG CTCGGCCTGT CCGGCCTGCT GGTCGCGTGC
GGGGTCGGCG GTGGCGGCGG CGAGAAGGGC GGTGGTGGCA GCGGCAGCGG CAGCATCCGC
GCCCTGTTCA TGCAGCAGGC GGGCTACTCC GAGGACAACA TCAAGGAGAT GACGGCGGCC
TTCATGAAGG CCAACACCGA CATCAAGGTC ACCGCCGACT TCGTCTCCTA CGAGGCGCTG
CACGACAAGA TCGTGGCGGC CGCGCCGGCG GGCACCTATG ACGTCGTGCT GATCGACGTG
ATCTGGCCGG CCGAGTTCGG CACCAAGAAC ATCGTCGCCG ACGTCACCGA CCGGTGGCCC
GACGAGTGGA AGCAGCAGAT GCTCGGCGGC GCGGTCGCGA CGCCGCAGTA CGACGGCAAG
TTCTACGGGG TCCCGTGGAT CCTGGACACC AAGTATCTCT TCTACAACAC CGCCCAGCTC
GAGAAGGCGA AGGTCGACGC CGGCGAGCTC GACACCTGGG ACGGCGTCCT CAGCGCGGCC
CGCGCGCTCA AGCAGAGCGG TGTCCAGTAC CCGCTGATCT GGTCCTGGCA GCAGGCGGAG
GCCTTGATCT GCGACTACAC CCAGCTCCTC GGTGCCTTCG GCGGAACCTT CCTCGACGAC
GCGGGCCAGC CCGCGTTCAA CCAGGGAGGC GGCGTCGCTG CGCTGGAGTT CATGCGGCAG
AGCATCGTCG ACGGGCTCAC CAACCCCGCC TCGACGCAGT CGCTCGAGGA GGACGTGCGG
CGCGTGTTCT CCTCCGGTCA GGCCAGCATC GCCCTGAACT GGACCTACAT GTACGGCCTC
GCCAACGACC CCAAGGAGAG CCAGATCCCC GGCGACGTCG CGGTGCTGCA GACCCCGAGC
GGCCCGGTCG GCCGCCCCGG CGTGAACGGC AGCATGGCGC TCTCCCTCTC CGCGACCAGT
GAGAACCAGG ATGCCGGCTG GAAGTACATC GAGTACCTCA CCAGCCAGCC GGTCCAGGAC
AAGTACGCCC TCAGCTCGCT GCCCGTGTGG TCGTCGTCGT ACGACGACCC CAAGGTCGTC
GACACGAACC CCGCCGTCGT GCCGCAGGCC AAGAAGCAGC TCGGCGACAT GATCCTGCGG
CCCCAGGTCG CCAGCTACAA CGCGATGTCC CAGGTGCTCC AGGCCGAGAT CCAGAAGGCC
CTGCTCGGTG ACAAGGAGCC GCAGCAGGCG CTGGACGACG CAGCCTCCCA GGCGGCCGAC
CTGCTGGAGT CCTGA
 
Protein sequence
MKRTSPVSGS LAGITAPRPT RRGLLVGGGA LGLSGLLVAC GVGGGGGEKG GGGSGSGSIR 
ALFMQQAGYS EDNIKEMTAA FMKANTDIKV TADFVSYEAL HDKIVAAAPA GTYDVVLIDV
IWPAEFGTKN IVADVTDRWP DEWKQQMLGG AVATPQYDGK FYGVPWILDT KYLFYNTAQL
EKAKVDAGEL DTWDGVLSAA RALKQSGVQY PLIWSWQQAE ALICDYTQLL GAFGGTFLDD
AGQPAFNQGG GVAALEFMRQ SIVDGLTNPA STQSLEEDVR RVFSSGQASI ALNWTYMYGL
ANDPKESQIP GDVAVLQTPS GPVGRPGVNG SMALSLSATS ENQDAGWKYI EYLTSQPVQD
KYALSSLPVW SSSYDDPKVV DTNPAVVPQA KKQLGDMILR PQVASYNAMS QVLQAEIQKA
LLGDKEPQQA LDDAASQAAD LLES