Gene Noca_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0433 
Symbol 
ID4597745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp462413 
End bp463492 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content71% 
IMG OID639775047 
ProductABC transporter periplasmic-binding protein 
Protein accessionYP_921662 
Protein GI119714697 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.127437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACCCA TCCATGCCCT GGCGCCACTG GCACCGCTCG TCGCGCTCGC CCTGGCGGCG 
ACCGGCTGCT CGACCATCGG AAACGGCTCG TCCGGCGACG AGTCGACGGC CAGCCCGGGC
GGCGAGCCCG GGCGGGTCGT GCTGGTCACC CACGAGTCGT TCTCGCTGCC CAAGCCGCTG
ATCAAGCAGT TCGAGCAGCA GTCCGGCTAC GACCTGGTGG TCCGCGCGTC CGGCGACGCC
GGGATCCTCA CCAACAAGCT GGTGCTGACC AAGGGCGACC CGCTCGGCGA CGTGGCGTTC
GGCATCGACA ACACCTTCGC CTCGCGGGCG CTCGACGAGG GCGTGTTCGC GCCGTACGAC
GCTCCGCTGC CGGACGGCGC CGACCAGTAC CGGTTGCCGG GTGACGACGA GCACGACCTC
ACGCCGATCG ACAACGCCAG CGTGTGCGTG AACGTCGACG ACACCTGGTT CGCCGACCAC
CATCTCGCCC CGCCGAAGAC CCTGGACGAC CTGGCCGCCC CGGAGTACCA GGGCCTGTTC
GTCACCCCGG GCGCGGCCAC CAGCTCGCCC GGCCTGGCGT TCCTGCTGAG CACGATCGCG
GCGTACGGCG AGGACGGCTG GCAGGACTAC TGGGCGCGGC TGATGGAGAA CGGGACCAAG
CTGACGGCCG GCTGGTCCGA CGCCTACGAG GTCGACTTCA CCCAGGGCGG CGGCCAGGGT
GACCGGCCGA TCGTGCTGTC CTACGACTCC TCGCCGGCGT TCACCGTCGC CGACGGGGAG
TCGTCGACGA GCGCCCTGCT CGACACCTGC TTCCGCCAGG TGGAGTACGC CGGCGTGCTG
GACGGTGCCG CGAACCCCGC CGGCGCCGAG CAGCTGGTCG ACTTCCTGCT CTCGCCGGAG
GTGCAGGCCG CACTGCCCGA GAGCATGTAC GTGTTCCCGG TCGACTCCTC GGTCCAGCTG
CCGAAGGAGT GGGCCCGGTT CGCGAAGCAG CCGAGCGACC CGTTCTCGGT GGACCCCGCG
TCGATCGACG AGCACCGCGA CGAGTGGCTG CGCGAGTGGA CCGACGTGAC CTCTCGATGA
 
Protein sequence
MRPIHALAPL APLVALALAA TGCSTIGNGS SGDESTASPG GEPGRVVLVT HESFSLPKPL 
IKQFEQQSGY DLVVRASGDA GILTNKLVLT KGDPLGDVAF GIDNTFASRA LDEGVFAPYD
APLPDGADQY RLPGDDEHDL TPIDNASVCV NVDDTWFADH HLAPPKTLDD LAAPEYQGLF
VTPGAATSSP GLAFLLSTIA AYGEDGWQDY WARLMENGTK LTAGWSDAYE VDFTQGGGQG
DRPIVLSYDS SPAFTVADGE SSTSALLDTC FRQVEYAGVL DGAANPAGAE QLVDFLLSPE
VQAALPESMY VFPVDSSVQL PKEWARFAKQ PSDPFSVDPA SIDEHRDEWL REWTDVTSR