Gene Noca_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1420 
Symbol 
ID4597327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1504169 
End bp1505443 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content67% 
IMG OID639776018 
Productextracellular solute-binding protein 
Protein accessionYP_922621 
Protein GI119715656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.979516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGTT TCCCGTTTGG TCGCCGTGGG GGGCGGACGC GCCTGATCGC GGCGGCTGCG 
CTGGCGGTCG CCAGCGTGGC CACCGTCGCA GGATGCGGCG CCGGCTCCGG AGCCAGTGCC
GGTGACGAGG TCAAGCTGCG CTTCCTGTTC CCGGAGTACA GCGCGAAGAC CGCCCCGCTG
ATGCAGGCGA TGGTCGACGA CTTCAACCAG GCCAACGACG GCAAGATCGA GGTGAGCCTC
GAGATCGCCC CCTGGGACAA GATGCACGAC AAGCTCGCGG TCTCCATGGG TTCGGGGCAG
GCGCCCGACG TGTTCGGCTA TGCCACCCGC TGGATCTCCG AGTTCGCGGG CCTCGACCAG
CTCGCACCCC TCGACGAGCA CCTCAGCGAG GACTTCAAGA GCACGTTCAA CCAGAAGGTG
CTCGAGGCCG GCACGTACGA CGGGAAGACC TACGGCCTTC CGGCCGCCGT CTCGGCGCGG
CTGCTGTTCT ACCGCGCGGA CGTGTTCGAG GAGGCCGGCC TGCAGGCGCC CCAGACCTGG
GACGACCTGA TGGAGGCCGC GACGACCACC GGGCAGCCGC CGGAGCGCTA CGGCCTCGGC
GTACCCGCCA GCGGGATCGA GGTCGACACG TTCTTCAACT ACTTCTTGTA CAACAACGGG
GGGGACATCC TCGACGAGAA CGGCAAGTCG ATGCTCAGTG CCCCGGAGAG CGTCGAGGCC
CTGCAGTACC TCACCGACCT GGTCAAGGCC GGGGGCAGTG AGCCGAAGCC CACCGGCTTC
ACCCGCGAGC AGGTCATCGA GAACTTCAAG GCCGGCCAGC TGTCCATGTA TCCCACCGGT
CCGTGGCTCA ACGCCATGAT CGAGGCCGAC AACCCGGACC TCGAGTACTC CGCGGTGCCG
TTCCCGACCA ACGACGGCAA GCCACAGCAG ACCGTCTCCG TGACCGACTC GCTCGGTCTG
TCGGCGAACA CCGAGCACCC CGACGAGGCG TGGAAGTTCG TCGAGTTCAT GTACCAGACG
AAGTACCGCC AGGCCTTCGA CGAGGGCGAG GGCATGCTCC CGGAGCTGAT CGCCGTGAGC
CAGTCGGACT ACTTCCAGAG CCCCGAGTAC AAGCCGTTCG TCGACGCCCT CGACACTGCC
AAGTTCCAGC CGCAGCACCC CAAGTTCGAG CAGATCCAGC AGATCGAGAC CGTCGCCGTG
CAGAAGGCGC TCAGCGGCCA GGCCACGCCC CAGGAGGCGC TGGACGAGGC CACCGAGCAG
ATCGACAAGC TCTGA
 
Protein sequence
MMRFPFGRRG GRTRLIAAAA LAVASVATVA GCGAGSGASA GDEVKLRFLF PEYSAKTAPL 
MQAMVDDFNQ ANDGKIEVSL EIAPWDKMHD KLAVSMGSGQ APDVFGYATR WISEFAGLDQ
LAPLDEHLSE DFKSTFNQKV LEAGTYDGKT YGLPAAVSAR LLFYRADVFE EAGLQAPQTW
DDLMEAATTT GQPPERYGLG VPASGIEVDT FFNYFLYNNG GDILDENGKS MLSAPESVEA
LQYLTDLVKA GGSEPKPTGF TREQVIENFK AGQLSMYPTG PWLNAMIEAD NPDLEYSAVP
FPTNDGKPQQ TVSVTDSLGL SANTEHPDEA WKFVEFMYQT KYRQAFDEGE GMLPELIAVS
QSDYFQSPEY KPFVDALDTA KFQPQHPKFE QIQQIETVAV QKALSGQATP QEALDEATEQ
IDKL