Gene RoseRS_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1420 
Symbol 
ID5208372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1729990 
End bp1731237 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content62% 
IMG OID640595031 
Productextracellular solute-binding protein 
Protein accessionYP_001275770 
Protein GI148655565 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000217844 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0146408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCC GCTTTTCGCC CGCTGCTCGA GTTCCACCGA TAGCCCTGAT AGTGCTGGCG 
TTCGTCCTTG CCGGGTGTGG CAACCTCGGC GGACTGCTCG GCGGACAACC GACCCCTGAA
CCGATCATTC TGATCGCAAC GGCGACTCCC GCACCGGTGT CTCAGGCGAC CGCCACGCTG
ACCGTTGCAC CATCGCCGAC CAGCGCGCCC TCCGACGATG CTACCGCCAC CCCTGCACCG
CCAACTGCGG AACCGCCCAC TCCCCCGCCG GCGCCGCAAA AGATCCTGGC GCGCGTCAAG
GAGCGCGGCT ACCTGATCTG TGGAACGAAC GCCGACCTGC CTGGGTTTGG CTTCTACGAT
AGCGTGCGCC AGACCTGGAG CGGCTTCGAT GTCGATTTCT GCCGCGCTGT CGCTGCCGCG
ATCTTCGGGG ATGCCACAAA AGTAGAGTTC GTCGCCCTCG GCACCGGACC AGGACCGAAC
AACCGGTTCG ATGCCGTGCG CGAGGGACGG GTCGACGTCC TGTTCCGCAA TACCACCTGG
ACATTGGGAC GGAACATCAG CGGTCTGGCG TTTGGTCCCA CGACCTTTCA CGACGGTCAG
ACCTTCATGG TACGCGCCAA AGACCGGATC ACGAAACTTG AAGATCTCGA AGGCAAGGTG
ATCTGTGTTG CAAAAGGCAC CACCAGCGAG CAGAACCTGA ACGACGACTT CGCCGCGCGC
GGCATCAGGT TCACTGCCCG CGTGCTTGAT GGCGAAGATG AACTCTACCC CGCCTACGAC
GAAGGCGAAT GCGATGCGGT GACCAGCGAC AGTTCCCAAC TGGCTGCCAA ACGTCAGCAA
CTCAAGAATC CTGCCGACCA CATCATTCTC GGCGACCGCA TCTCGCGCGA GCCGCTCGGT
CCCGTCATCG CCCGCGACGA CAACCAGTGG CTCGACGTGA TCAGCTGGAC GGTCTTTGCG
ACGATCTATG CAGAGGAGTT GCGTGTCGAC CAGCGCAACG TTGATCGGTT GCGCACCAGC
ACAACCGATC CGCGCATCAA ACGGTTGCTG GGGTTGGAAG GAAACTTTGG CGAGGGATTG
GGGCTACCGA ACGACTTCGC CTACCAGATC ATCAAGCAGG TCGGCAACTA CGGCGATATT
TACAACCGCA ACCTGGGACC AAACACTGTT ATCAACCTTG ACCGCGGACC GAACAAGGTC
TGGAATCTCG GCGCCGGCGG CGTGCTGGCG TCCCCGCCGT TCCGCTGA
 
Protein sequence
MHTRFSPAAR VPPIALIVLA FVLAGCGNLG GLLGGQPTPE PIILIATATP APVSQATATL 
TVAPSPTSAP SDDATATPAP PTAEPPTPPP APQKILARVK ERGYLICGTN ADLPGFGFYD
SVRQTWSGFD VDFCRAVAAA IFGDATKVEF VALGTGPGPN NRFDAVREGR VDVLFRNTTW
TLGRNISGLA FGPTTFHDGQ TFMVRAKDRI TKLEDLEGKV ICVAKGTTSE QNLNDDFAAR
GIRFTARVLD GEDELYPAYD EGECDAVTSD SSQLAAKRQQ LKNPADHIIL GDRISREPLG
PVIARDDNQW LDVISWTVFA TIYAEELRVD QRNVDRLRTS TTDPRIKRLL GLEGNFGEGL
GLPNDFAYQI IKQVGNYGDI YNRNLGPNTV INLDRGPNKV WNLGAGGVLA SPPFR