Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4226 |
Symbol | |
ID | 5211211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5294601 |
End bp | 5295770 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597815 |
Product | membrane protein-like protein |
Protein accession | YP_001278519 |
Protein GI | 148658314 |
COG category | [S] Function unknown |
COG ID | [COG3503] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.279429 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTTC ACAGTACTGC GGGTGTTCTG ACGCAGACCG AAGTGACAAT GCAGGTCGGC GCCGGTAGTC GCATCACCGC CATCGACGCG CTGCGCGGCG TGGCGCTGGC GCTCATGGCG CTGACCCACG CGGCGTTTTT CATCGGCGTC GGCATGCAGG CCGAGTCGTA TGGCGGGCAG CGCGTGTATC TGCAAAGCCC GCCCTACTGG ATCTCCGGCC TGCTCACCAC CCTGGCATCG CCGATCTTTT TCTGCCTGGC GGGCGTCAGC CTGGCGCTGC TCGAACAGTC GCGGGTGCGC AAAAGCGCGT CACCGTGGGC AGTGAGCCGC TTCATTCTGG CGCGCGCTGG GGTCATCATG GCGCTCGATC TGACGATCTG CGCCTGGTTG TGGCTGGGGA AGATGCCGTA CATCCACGTG CTGACTGCCA TGGGGCTGGG GATGATCATC CTGTCCGGGT TGCGCCTGTT GCCGACCCGC GCGATTCTGG CGGTTGCGAT TGCAACATTG CTCGTTCACC AGGGCATGAT CGAGGTCCTT CGTCCGCAAC TCGAAGCGGG AGCGCCGCAG AGCCTGGCGC AGGCGCTCTT TCTGACCTAC AGTTATGAGA CGCATCCGCC GGTCGGGTTC CCGGTGCTGG GATGGGGTCC GGTGATGTGG CTGGGATTTG TGCTGGGGCG GAATCTGAGC CAGCCGATGT TGCGTCAACC ACGCACGTGG ATCGTGATCG GGGGGGGACT GTTGCTGATC TGGGCGGCAT TGCGGCTGAT CGGCGGCTAC GGTGATCTGG GAGCATACCG GGCTGGCGAA CCGATCCAGT ATGTTCTGGT GATGAGCAAG GCGCCGCCGA GCCTGAGCTA TCTGGCGTTC AAACTGGGCA TCGCGGCGCT GATCTTTGCG GCGCTGGTCG CCTTCCCAAC CCTGATTGAC GCCGGTCTGT TGCGCATATT GACGCTTATC GGGCAGACCT CGTTGTTCTT CTACGTGATG CACATCGTTA TCTATCACGC CCTGGCGCAG TGCTTCTTTC TGTTCGATCC GCCCGAATTG CCGGGCATTG TGTACGGATA TGCGGTGTGG GCGCTGGGAA TGATGGCGCT GGTTCCACTC TGCGAGCGCT ACCGCGCGCT GCGGAAGCGA TATCCGGAGA GTGTGCTCAG ATATTTGTAG
|
Protein sequence | MALHSTAGVL TQTEVTMQVG AGSRITAIDA LRGVALALMA LTHAAFFIGV GMQAESYGGQ RVYLQSPPYW ISGLLTTLAS PIFFCLAGVS LALLEQSRVR KSASPWAVSR FILARAGVIM ALDLTICAWL WLGKMPYIHV LTAMGLGMII LSGLRLLPTR AILAVAIATL LVHQGMIEVL RPQLEAGAPQ SLAQALFLTY SYETHPPVGF PVLGWGPVMW LGFVLGRNLS QPMLRQPRTW IVIGGGLLLI WAALRLIGGY GDLGAYRAGE PIQYVLVMSK APPSLSYLAF KLGIAALIFA ALVAFPTLID AGLLRILTLI GQTSLFFYVM HIVIYHALAQ CFFLFDPPEL PGIVYGYAVW ALGMMALVPL CERYRALRKR YPESVLRYL
|
| |