Gene RoseRS_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4005 
Symbol 
ID5210988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5010167 
End bp5011336 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content60% 
IMG OID640597594 
Productextracellular solute-binding protein 
Protein accessionYP_001278300 
Protein GI148658095 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.431376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG CGCTCGGATG GCTGCTTGTT ATTCCGCTGT TGCTGGCAGC ATGCGGTCAG 
CAGGCAGCAC AGACGGGGCA ACCCGTCGAA GTCACCCGCA TTGTCGAAGT CACCCGCGTC
GTTGAAGTCA CCCCTGCAGG CGGAGCGGCT CTGGTTCAAC CGGCAGCAAC CCCCGCTCCC
GCTCCGGCGC CTGCCGCGCC AGCCGGTTTC GGTGAGACGC TCAAGGCGAT CCAGGCGCGC
GGCAAACTGA TCTGCGGCGT CAACAGCCAG GTGCCCGGTT TTGGTTTCGT TGACCCGACC
GGTGCGTTTA GCGGGTTCGA CATCGACTAC TGCAAGGCGC TGGCAGCAGC GATCTTCAAC
GATGTCAGCA AAGTGGAGTA TCGCCCGCTG ACTGCCGAGC AGCGTTTTGC CGCACTCCAG
AGCGGTGAAA TCGATGTGCT CATCCGCAAC ACCACCTGGA CGCTCACCCG TGATACCGAT
AACGGCGGCA ACTTTGTCGC TACCACGTTC TACGATGGTC AGGGCATCAT GGTACCGAAA
GCCTCGAACA TCACGAAACT CGAAGATCTG AACGGTGCGA CCATCTGTGT GCAGAAGGGG
ACCACGACAG AGTTGAACCT GGCGGATCAG ATGGCGGCGC GTAAACTTCA GTACACCCCT
GCCGTCTTTG AAGACGCCAA CAGCACCTTC GCCGCATATG CAGAAGAGCG CTGCGATGCG
GTGACAACCG ATAAATCCGG TCTGGTATCG CGCCGGTCGG TGCTGCCGAA CCCGGATGAT
CACGTCATCC TCGATGTCAC CCTGTCGAAG GAGCCGCTCG GTCCAATGGT GCGCCAGGGT
GATGATCAAT GGTTCGACAT TGTGCAGTGG ACGGTGTTTG CCACCTTCGC CGCCGAGGAG
TTCGGCATCA CGTCACGGAA TGTCGATCAG GCGAAGGAGA GCGATACGCG CCCCGAAGTG
CGGCGGTTGC TCGGCGCCGA TCCGAATGTG GACCTGGGCG CCAAACTGGG CTTGAGCAAG
GATTGGGCTG CGAATGTGAT CAAGTCGGTC GGCAACTATG CCGAAATCTA CGACCGCAAC
CTGGGACCGA ATACGAAGAC GGCGATTCCG CGCGGTATTA ATAACCTGTA CACGCAGGGC
GGGTTGCTCT ACGCGCCGCC GTTCCGGTAA
 
Protein sequence
MKRALGWLLV IPLLLAACGQ QAAQTGQPVE VTRIVEVTRV VEVTPAGGAA LVQPAATPAP 
APAPAAPAGF GETLKAIQAR GKLICGVNSQ VPGFGFVDPT GAFSGFDIDY CKALAAAIFN
DVSKVEYRPL TAEQRFAALQ SGEIDVLIRN TTWTLTRDTD NGGNFVATTF YDGQGIMVPK
ASNITKLEDL NGATICVQKG TTTELNLADQ MAARKLQYTP AVFEDANSTF AAYAEERCDA
VTTDKSGLVS RRSVLPNPDD HVILDVTLSK EPLGPMVRQG DDQWFDIVQW TVFATFAAEE
FGITSRNVDQ AKESDTRPEV RRLLGADPNV DLGAKLGLSK DWAANVIKSV GNYAEIYDRN
LGPNTKTAIP RGINNLYTQG GLLYAPPFR