Gene RoseRS_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4004 
Symbol 
ID5210987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5008907 
End bp5010097 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID640597593 
Productpolar amino acid ABC transporter, inner membrane subunit 
Protein accessionYP_001278299 
Protein GI148658094 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4597] ABC-type amino acid transport system, permease component 
TIGRFAM ID[TIGR01726] amine acid ABC transporter, permease protein, 3-TM region, His/Glu/Gln/Arg/opine family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0588648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACACC CTCCGCGTTC GACATCCCCA ATGCGGTTGA GTCGCGCTTT CTGGCGTGAT 
GAGCGCGTCA TCCAGGTCGC TGCCCAGATC CTGTTTCTGG CGCTGGTTAT CTGGGTGGGA
TCGATTGCGC TGCGCAACAT GCTGACGTCG TTGCAGCAGC AGGGTTTGGT GCTGGGGTTC
GACTTTTTGA ACAGCGGCGC CGGGTTCGAT ATCAGTGATG CTCCAATCCC GTACAGCAGC
ACCGATACGT TCGCTCGCGC CCTTCAGGTC GGGTTACTCA ACACGATCCT GGTGAGTGTG
CTCGGCATCG TTCTGTCAAC GCTGCTCGGC ATTGTGGTTG GGGTGGCGCG TCTTTCGAGC
AACTGGCTCG TCAACCGCAT GGCGTGGGTG TTTGTCGAGG TGATGCGCAA TGTGCCGCTG
CTGGTGCTGC TCGTGTTCAT TTACACGGCA TTCTTTCTGA AACTTCCCCG CGCGCGGCAG
GCGGTGAGCC TCGGTCCGAT CTACCTGAGC AATCGTGGGG TGGCGATACC CTGGGGCGAG
CCGACCGACA CCTGGTCGCT CTATGTTTCG GTTCTGATCG GCGCCCTGAT CGCCGGTGCA
GTGGTGGGTC TGGCAATGCG CTGGTGGCAG AACCGGAGTG GTCGTCCGCG TCCGCAGGTT
GTGCCATCAC TGCTGACGAT GACGCTGATT GCGCTGATCG GATGGTTCGC CCTGCCGCAA
CCGCCGCTTG CGCTTTCACT GCCGGAGATC GCCGGCTTCA ACTTTCGCGG TGGTCAGGTG
CTCTCACCCG AATTTATGGC GCTCCTGATC GGTCTCGTTA TCTACACCGC AGCATTTATC
GGCGAAGTGG TGCGAGCCGG CATTCAGGCG GTGCCGAAAG GTCAGGTCGA AGCGGCGCGG
GCGCTGGGTC TCAACCCATC GCGCACGCTG CGGCTGGTCG TGTTTCCGCA GGCGTTGCGC
GTGATCATTC CGCCGGTCAC CAATCAGTAC CTGAACCTCA CCAAAAACTC GACGCTGGCG
GTCGCCATCG GATACCCTGA TCTGTTCGCA ATATCGGGAA CGATCATCAA TCAGACCGGG
CGCGCTGTTG AGATGATTGC AGTGGTCATG GCGGTGTATC TCATGCTGAG TCTGATCACG
TCGCTGGTGA TGAACTGGTA CAACCGTCGT GTACGTCTGG TGGAGCGTTG A
 
Protein sequence
MEHPPRSTSP MRLSRAFWRD ERVIQVAAQI LFLALVIWVG SIALRNMLTS LQQQGLVLGF 
DFLNSGAGFD ISDAPIPYSS TDTFARALQV GLLNTILVSV LGIVLSTLLG IVVGVARLSS
NWLVNRMAWV FVEVMRNVPL LVLLVFIYTA FFLKLPRARQ AVSLGPIYLS NRGVAIPWGE
PTDTWSLYVS VLIGALIAGA VVGLAMRWWQ NRSGRPRPQV VPSLLTMTLI ALIGWFALPQ
PPLALSLPEI AGFNFRGGQV LSPEFMALLI GLVIYTAAFI GEVVRAGIQA VPKGQVEAAR
ALGLNPSRTL RLVVFPQALR VIIPPVTNQY LNLTKNSTLA VAIGYPDLFA ISGTIINQTG
RAVEMIAVVM AVYLMLSLIT SLVMNWYNRR VRLVER