Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_1037 |
Symbol | |
ID | 5207983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1274236 |
End bp | 1276044 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640594651 |
Product | hypothetical protein |
Protein accession | YP_001275396 |
Protein GI | 148655191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCT CGTCGCCGCT CATCGCATTG GTTGGGTTAT TTCTGGCGCT GTGTGTGGTG TACAACGCCG TGACGCCGCT CGGTGAGGGA CCGGACGAGC CGGGGCATGC GGCGTATGTG TTCTTTCTGG CGCGCGAGGG GCGTCTGCCG GTGCAATGTT CGCCGCCCTG CGCCAGTGAT GTGCCAGGAT CGGGGCATCA TCCGCCGCTT GCATATCTGC TGGCGATGCC TGCGGCGCTC TGGTTGCCGC CAACGCTGCG TACCTTCGAT CTGCCCGGTA ATCCACGCTT TACCTGGGCA GGCGGCGATC AGGTGAACGC GGTGGCGCAC GGGAGCCGCG AACAGTGGCC CTGGGACGCA CAGGTGTGGT CCTGGCGATT GGCGCGCCTG GCGTCGAGTC TGGCCGGCGC GGCGACGATC ATCTTTACCT GTCTGGCGGC GGATGCGCTG CATCGTCGCC TGAACAACGA TCCGGCGCGT GGGAACAGCG ACCAGAAGGC GCTGCTGGCA GCCGCACTGG TTGCGTTCAA CCCGCAGTTC ATCTTCACAT CGTCGCTGGT GACGAATGAT GCGCTGCTGG CGGCGCTCGG CGCGGCGCTT CTCTGGCTGT TCATCAGCAG TCCGCGGTCG CTGCCGCATA CGGCGCTGAT CGGAACGATC CTGGGAATGG CGCTGATCAC CAAACAGAGT GCGCTGCTCT TTCTACCGCT TGCGCTTGCC TGGTGCGCAA CCGGCGGCGT CCGGCAACGA TGCGCAGCCG CGATCCCGGT TCCTGCTGCG CTGCTGGTGG TTACTGGCGT CGCAGGACTG GTTGGCGGTT GGTGGTATGT GCGCAACTGG ATTCTGTACG GCGATCCGCT CGGTTTGCAG GCGTTTCGGG TGGAGTTCAC AACGCAGGCG TTCGAGGTGA CCAACCCGGC AGCCTGGACA GGTGCGCTGA CGACCCTGCA TGAATCGTTC TGGGCGCGCT TCGGCTGGAT GAACCTGCCA GCGCCAGCAT GGACGATGCT GGTCTATACG CTGGTGATCG CGGCAGCGGC TGCCGGGTGG ATACGACGTT CGATCCGACC GCACCAGATA ACGTGCGCAT GGAACTGGCA GCTTGCGGCG TTGCCGCTGC TGGCGTTCCT GTGGGTGGTG AGTTTCGCTC TGACCGCCGG GCTGGTTGCC TGGCAGGGAC GGATGCTCTT TCCTGCGCTG GCAGCGATTG CGATCCTGAT TGCGTGTGGT CTGGGAGTGT GGATCCGTGG CCGTGTACTG ACGCTCACGA TCATTATCGG AATGGCGGGA CTGGCCGCCT GGCTGCCGTT CGGTGTCATT CAACCGGCGT ATCCACGGCA GACCCTTGCG GCAGAAGCGG CGCGCAACTG GGAAGGTATC GACACCTATG CGCGGTTTGC CCGTTCGACC GAACCCGGTG CCGTCATTCG CCGCTGGCGG ATCGACGGAA CGCCGCGCCC TGGCGCAACC ATCGAAGTGG CGCTGCTGTG GCACGCACGC TCACGCCAGG ATCGCGACTG GTGGACATTC GTTCACCTGG TCGATGACAA CCGACGGATT GTCGCCGAGG ATAACCGCGA ACCACGCGAC GGCGCGTACC CGATGTCGCA ATGGGTTGCT GGCGATTGGG TGGAGGCGCG CTACACGCTC GCCATTCCCG CCGATCTGCC GCCCGGATCA TATGCGCTCT GGGTTGGGTT GTGGGACCCG GCGACCGGAC GGCGCGCTGC GTTCTTCGAT GATGATAATG TCTACGACCC TGATAGTGAT CATGTCGTGC TCACCACACT GGTTATCACC GGACAATAA
|
Protein sequence | MNASSPLIAL VGLFLALCVV YNAVTPLGEG PDEPGHAAYV FFLAREGRLP VQCSPPCASD VPGSGHHPPL AYLLAMPAAL WLPPTLRTFD LPGNPRFTWA GGDQVNAVAH GSREQWPWDA QVWSWRLARL ASSLAGAATI IFTCLAADAL HRRLNNDPAR GNSDQKALLA AALVAFNPQF IFTSSLVTND ALLAALGAAL LWLFISSPRS LPHTALIGTI LGMALITKQS ALLFLPLALA WCATGGVRQR CAAAIPVPAA LLVVTGVAGL VGGWWYVRNW ILYGDPLGLQ AFRVEFTTQA FEVTNPAAWT GALTTLHESF WARFGWMNLP APAWTMLVYT LVIAAAAAGW IRRSIRPHQI TCAWNWQLAA LPLLAFLWVV SFALTAGLVA WQGRMLFPAL AAIAILIACG LGVWIRGRVL TLTIIIGMAG LAAWLPFGVI QPAYPRQTLA AEAARNWEGI DTYARFARST EPGAVIRRWR IDGTPRPGAT IEVALLWHAR SRQDRDWWTF VHLVDDNRRI VAEDNREPRD GAYPMSQWVA GDWVEARYTL AIPADLPPGS YALWVGLWDP ATGRRAAFFD DDNVYDPDSD HVVLTTLVIT GQ
|
| |