Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3987 |
Symbol | |
ID | 5210970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4988555 |
End bp | 4989715 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640597578 |
Product | hypothetical protein |
Protein accession | YP_001278284 |
Protein GI | 148658079 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCCG TCAAAATACC CGTCGCCTTA CTGATCTTGA TCGGCGGCAG GCAGACGCCA AATGTGCTCA GCGCCCAGTT CCTGCGCCCC GACATTATTG CGCCGATTGC TTCACGCGAG GCGATGCGTC CAGGCGAAGC ATGGGAGAAG GTCAGGCGCG TCCTTGAGCA ACTCAGTCCA CGGGTGCTTG ACCCGCACAC CGTCGATGCG TTCGATCTGA ACGATATTCG CGAGCAATGC GCTGCGGCGA TGGATCGCTT TCCCGATGTC CGTTGGGTGT GCAATATCAC CTGCGCCACC ACGATCATGA GCATTGGCGC ATATGAGGTG GGACGGATGC GCAACGCCAG CGTCTGGTAT TTCGACACAG CCGGAAGGCG CGTTGTGACG CTGGCCGGTC AACCGCCGGA CGGCGATCCA TACCGGCTTT CGGTGGAGAA CTATCTCCAG ATATACAATC GTGCGGCTCA ACCGACGCCA CCTCCACCGG TGTCGTGGGT GGCACTTGCA CGACAGATGG CGCAGGCGCC TGATGACGCG ATTGAATTTC GTGAGATACT ACGCCGTGCG AACGCCGACG CCAGATTCAC ACAACCGCGC CGTCTTGCAG TGCTCTCCCT GACGCCGACA ATGGTTCAGT GGTGCGAACA GGCGCAGGCT GCCGGTTTCA TCTCCGCCAT ACACCAGCAC TCGAACCATC ACGAGATACT TCAGGCAGAT GGCGCATTCT GGGATTTCGT CAACGGCGCA TGGCTGGAGA TCTACGCCTG GGATGCAGCG CAACGCGCCG GTTGCTTCGA TGACTGTTGT CCTGGCATCG AAATACCTGC GCAGGGCGGG CTTTCGCCGA TGAATCAGAT CGATCTCGCG GCCACCCATG CCGCCTCGCT CCTGATCGCC GAATGCAAGA CAGAGGCGCG ACCGTTTCGC ACCGAGCATC TCGATCAACT GCGCGCGATC ACCAGCATGA TTGGTGGCTC ATTCGTGGGC GCATTGTTCA TCACAGCGCG CAGCCAGCAC AAAGCTGATG CACAGGCGCT CGCTGCCTTC CGTGCGCAGG CACAGGCGCG CCAGATTGTG GTGGTCACAG GCGATCAACT GAATCAGTTG CCGGATATTT TGACGCGCGA GGCGACCAGG CCGACATTTC CGCGAGGTTA A
|
Protein sequence | MASVKIPVAL LILIGGRQTP NVLSAQFLRP DIIAPIASRE AMRPGEAWEK VRRVLEQLSP RVLDPHTVDA FDLNDIREQC AAAMDRFPDV RWVCNITCAT TIMSIGAYEV GRMRNASVWY FDTAGRRVVT LAGQPPDGDP YRLSVENYLQ IYNRAAQPTP PPPVSWVALA RQMAQAPDDA IEFREILRRA NADARFTQPR RLAVLSLTPT MVQWCEQAQA AGFISAIHQH SNHHEILQAD GAFWDFVNGA WLEIYAWDAA QRAGCFDDCC PGIEIPAQGG LSPMNQIDLA ATHAASLLIA ECKTEARPFR TEHLDQLRAI TSMIGGSFVG ALFITARSQH KADAQALAAF RAQAQARQIV VVTGDQLNQL PDILTREATR PTFPRG
|
| |