Gene RoseRS_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3940 
Symbol 
ID5210924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4932077 
End bp4933396 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content67% 
IMG OID640597536 
Productextracellular solute-binding protein 
Protein accessionYP_001278242 
Protein GI148658037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACTC CTGCCCTGAC CATCCGCCGG TATGCCTGTG CGCTGGCGAT CATGCTGCTG 
GCGGCATGCA GTCAACCCGG TGCAGCGCCG CCAACACCGG CGCCTGCTCC AACGGTTGCG
CCGACAACGC CGGCAACGCC GCTGCCTTCG CCGACCGCTC CACCGACCGC AACGCCGCCA
CCCACACCCA CTCCGGCGCC CGAACCGCTG ACGGTGTGGG TGGCGGCTGA CGAAGCGCAC
CGCGACGCAC TGACCCGCCT GCTCACCGAC GCCGCTGCCG AAACCGGCGT TCCGGTCCGC
ATCATGAGCG GCAGCCCCGA CGCCATGATC GCCCGGTTGC GCGTTGATCA ACTCGACGGG
CGCCCGCCGC CAGACCTGAT CTGGGGGAAC GGCAATGACC TGGCAATCCT GCGCACGATG
GGGCTGATTC AGGCGACGCG TCGGACGGCT TCTCCCAACG ACACGCTGCC CGCTGTCATC
ACCGGCGCCA CTACCGACGG TCAACAGTGG GGTGAGCCGG TTGCGGCGCA GGGATTCTTG
CTGCTGCTCT ACAATCGGAA ACTCGTGGAA CATCCGCCGC GCACCGTCGA TTCACTGATC
GCCACTGCGC GTGCCAACAC TGGCGGCAAT CGGGTCGGAC TGGTCGCCGG ATGGACCGAA
GCGCGCTGGT TTGCGTTGTG GCTGGATATG ACCGGTGGAA CGATGCTTGA TGCTGATGGG
ATGCCGCTAC TCGACACACC CGCAGTTATC GCAGCGCTCG ATCTGCTGCG CACCATGCGA
CGGTATGGAC CGACATCCCC CTCGACCTAT GACGAAGGCG CACGGTTGTT CCGTCGCGGC
AGGGCAGCGC TGGCAATCGA CGGCGACTGG GCGCTGGAGA GTTATCGCGG ATTGACCGAG
ACGCTGGAGT TGGGCATTGC GCCGCTGCCG TTGACCAGCC GCGGAACGCC AGCGACAGCG
CCGCTGACCG GCGTCTACCT GATGTACGGC GCCGCGCTCG ACGCCTCACG CCTGGCACAG
GCGGAAGCGC TTGCACAGAC CCTGCGCGAA CCGGCATGGC AGGCGCGCAT TGCCCGCGAC
ACAGGGATGC TCCCGGCTTC CATCGCTGCA CTGAGCGACC CGGCGGTCAC TGACGATCCG
GCGCTTGCCG CCGCAGCGCA GTACGCCAGA AACGCACCCG GCATCCCGCC CGACCGCCCG
ATCCGCTGCG CCTGGGATGC TATCGAAGCG GCGCTCTCCC CGTTTTTGCT TGGCAAACGC
ACCGCCGCCG AAACCGCATC AGCGATGCAA CAGCGCGCCG ACGCCTGTGC ACGTCAGTAG
 
Protein sequence
METPALTIRR YACALAIMLL AACSQPGAAP PTPAPAPTVA PTTPATPLPS PTAPPTATPP 
PTPTPAPEPL TVWVAADEAH RDALTRLLTD AAAETGVPVR IMSGSPDAMI ARLRVDQLDG
RPPPDLIWGN GNDLAILRTM GLIQATRRTA SPNDTLPAVI TGATTDGQQW GEPVAAQGFL
LLLYNRKLVE HPPRTVDSLI ATARANTGGN RVGLVAGWTE ARWFALWLDM TGGTMLDADG
MPLLDTPAVI AALDLLRTMR RYGPTSPSTY DEGARLFRRG RAALAIDGDW ALESYRGLTE
TLELGIAPLP LTSRGTPATA PLTGVYLMYG AALDASRLAQ AEALAQTLRE PAWQARIARD
TGMLPASIAA LSDPAVTDDP ALAAAAQYAR NAPGIPPDRP IRCAWDAIEA ALSPFLLGKR
TAAETASAMQ QRADACARQ