Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3821 |
Symbol | |
ID | 5210803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4781405 |
End bp | 4784266 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640597417 |
Product | hypothetical protein |
Protein accession | YP_001278125 |
Protein GI | 148657920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00157336 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGATGC GTCGCGCGTT TGCCAGAGTC TTGCAGTTTA TTCTGAGCCT AATGCTGCTC CTCGCGCCTG TGCTGGCAGT GCAATCGGCG CAGGCGCAGA CCCCGCTCCC GCCGCTGGTG TTTGTGGCGC GTAGTCGTCT GGCGACCGGC GATTACCTCT TCCCCCGCGA TGTCGGTCCG GCGGGGCACA TGATTACCGG TATCACAAAA TTCGCTCCCG GCTCGAAATT GCTCATCCGC GATCAGAGCG GGCAATTGCG CGTGCTCGTC GATACCGCGC GACCGGCTGG CGATCCGCTG AACCCGCTGG GACTGCGTGA TGTTCAGTCG CCCGATGTGT CGTTCGATGC GCGCCGCGTC GTCTTCGCGG GAACGTTCGG ACCAGAAACG TTCCGCAATC AGCCCAATGG TCGTCCCGCC TATTCGTGGC GATTGTTCGA GATCGGCGTC GATGGTCGCG GTCTGCGCCA GTTGACTCGC TCCGACCGCG AGATCACCAT CCCTGATGGT CCGGGGAACG CCGAGGCGTA TGCGTTCTAC GATGATCTTT TCCCGGCATA TCTGGCAGAC GGCAGGATCG TGTTCAGTTC GTCGCGCTAT CCGGCGCGTT CCCCCTACGA CGGTCGACGG GCGTTCAATC TGTACATCAT CGACGGGGAT GGCGGCAATA TGCGGCGTCT CACGACCGAA CGCACCGCTG CATTCCATCC GGCGCCGCTC CCCGATGGCA GGATCGTGTT CAGTCGCTGG TGGGTCAACT TCAACCAGCC CAGCGAACGC GGCATCTATA ACCGGATCGA CAATCGCGCC GGCAATGAAA TCGCCCGGGA TCAGAGCGGG CGTCCGATTG TTGTCGAACG GCGCATCCAG GTCACTGCAA CAGCGCAACC GGCACAACCG CCGCAACCTG CTGCACCCCC GCCGACACCA ACCATGCGTT TGCCTTCCTT TGTCGAAAAG ATCGACCCGG CGACCGGCAG CATCGTTCGC CTGACCAAAA CGCCGGCACC GCCAACGCCA ACGCCGCGCG TCCGCCCGAC CGCGACGCCC GTTCCTGCGT CGCAGGGTGG CGCAAGTCGC ACGATTATCG TTGAGCAACC GGTTACCGGG TATCGCCTGC CGGATGGCAC GCTGGTCTAT TCGAACACGA ATGCGACCTT CAACCCGGCG CGTGGGCGTC TGGCGGATGG CTTCCCCATC CGCGACGCGC CCAACACCTG GCATCTGATG GCGGTTGAAG CCGATGGCAG CGGCATGAGC CGTTTTGCCT GGACGCCGCG CTACCCATCG GCGCTGACGA ATGATGGCGG TCTTGATACG TACAATGCAG TGCAACCGGC GGTGGTGCAG TTCGGTGGTG AATTGCTGGT GGCGTACACC ACGCAGCGCG ATCAGACGAT GGCGCATTCG ACGTTGTACA CCGGCATCCG TGTCGCACGC CCCGGTATCG AGAACATGGC GCTGAACACG ACCGAGTCGA TCGCCGGGTA TCGCTGGGAT GACGGAACCA GTTTTCGCCC GCCCTATGCG CTTGCGCCCG CCGGATTGCC CGACGGTCGG ATCATCTTCT CGCAGACGGC GGCAACGACG GCGCCTGCAC GCACCGGCGT CTACACAGAG ACTCGCAATG GTCGCACCAT CACCCTGCGG TTGCAATCGT CGTCACTGCG CTACGAATTG CGCACCATCT ACCCCAACGG CGCACAGAAC GAGAGCGTGC CGCTCCCCGG ACTATCCGAT GAGTATGACG CCATGGAAGC CAGACCAATT GTTGTGCGTC CGGTGGGAGA CGGACCCGGC ATGTGGCGAC TACCGCGTGG TACACCGCCG CCGGTCAGCG ACGATCCGTT GCAGAGTAAT GTGCCGTTTG GGTTGCTCGA CACGTCTGGC AACCCGGCGT ACACCTGGAG CCGCCGCAGC ATCCAGAGCG TGGAACTGGT TGCGGTGCGC AATGCGAATG TCTACGCCAA TCCACCGCTT GAATTCCCGT TTATCAATAA TTCGCCGCCG CCGGGAAGCG TGGCGTTCGC CGATATCTAT ATCGATGCCA ATCAGTTCGG CGGGGCGACG TCGCGCGCGC CGAACCCGGA CGACCAGGCG CGCGCGGTCA AATGGTTGAC GGTGCCGGTG AACCCGGACG GGTCGTTTAT CGCCTCTGCG CCCGCCGATG TGCCCACCTT CATTGTGCTG CGCGACCGGA ACGGGCGCAT TGTGCGCGGC GGCAACCGCC ATACCCTCAG CATCGCCCAG GGCAACTCAG CCGGGCGTCC CGGACAGCCG ATGTTCTGTG TCGGCTGTCA CATGGGGCAC GCCAGCGGCT CGATTGTCAA CCGGCAACTC GCGGAACTGG GCTGGACGAA CATTGCTCCG GCGGCATCCA TTGCCGCCTC TTCATCAGTG GAGAATGGCA GTCCGACACG GATCAACGAC CGTCGCGGGT ATGTTGCCGC GCCGAATGGA ACCTTGATCG ACCGGACGCC GCCGTGGACG GCGAAGGGCG GGGCGGGGCA GTGGATCCGG TTAGAATGGC AATTCCCGAT GGCGATCCTC GAAGTTCGTC TGGTGGGAGC GGAACCGGGT CAGGAAGGAC GCAGCGATGA CTACCAGGTC AGCGGCGAAC TGCGTTTCTA CCTGCGCGGG CAGGAAGTCG CCGTCGCCGC CAGAACCGTC GAGGCGGTTG CGCCCCTCTC GCGTGGCGGA ACGCTTATGC GTCTGGCGCA ACCCATCGCT GCCGACCGGG TCGAGTTCAC CGTCACTGCG GTGCGCGGGA CGCAGCGTGG CGCACCGGCG TCGGCGGCGC TCAGCGAAAT CGAAGTGATC GGGCAGGGCG CGACGCCGGA TGCGTTGGGG GTTGGGCGTT GA
|
Protein sequence | MLMRRAFARV LQFILSLMLL LAPVLAVQSA QAQTPLPPLV FVARSRLATG DYLFPRDVGP AGHMITGITK FAPGSKLLIR DQSGQLRVLV DTARPAGDPL NPLGLRDVQS PDVSFDARRV VFAGTFGPET FRNQPNGRPA YSWRLFEIGV DGRGLRQLTR SDREITIPDG PGNAEAYAFY DDLFPAYLAD GRIVFSSSRY PARSPYDGRR AFNLYIIDGD GGNMRRLTTE RTAAFHPAPL PDGRIVFSRW WVNFNQPSER GIYNRIDNRA GNEIARDQSG RPIVVERRIQ VTATAQPAQP PQPAAPPPTP TMRLPSFVEK IDPATGSIVR LTKTPAPPTP TPRVRPTATP VPASQGGASR TIIVEQPVTG YRLPDGTLVY SNTNATFNPA RGRLADGFPI RDAPNTWHLM AVEADGSGMS RFAWTPRYPS ALTNDGGLDT YNAVQPAVVQ FGGELLVAYT TQRDQTMAHS TLYTGIRVAR PGIENMALNT TESIAGYRWD DGTSFRPPYA LAPAGLPDGR IIFSQTAATT APARTGVYTE TRNGRTITLR LQSSSLRYEL RTIYPNGAQN ESVPLPGLSD EYDAMEARPI VVRPVGDGPG MWRLPRGTPP PVSDDPLQSN VPFGLLDTSG NPAYTWSRRS IQSVELVAVR NANVYANPPL EFPFINNSPP PGSVAFADIY IDANQFGGAT SRAPNPDDQA RAVKWLTVPV NPDGSFIASA PADVPTFIVL RDRNGRIVRG GNRHTLSIAQ GNSAGRPGQP MFCVGCHMGH ASGSIVNRQL AELGWTNIAP AASIAASSSV ENGSPTRIND RRGYVAAPNG TLIDRTPPWT AKGGAGQWIR LEWQFPMAIL EVRLVGAEPG QEGRSDDYQV SGELRFYLRG QEVAVAARTV EAVAPLSRGG TLMRLAQPIA ADRVEFTVTA VRGTQRGAPA SAALSEIEVI GQGATPDALG VGR
|
| |