Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1976 |
Symbol | |
ID | 5539454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2529902 |
End bp | 2531317 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894111 |
Product | hypothetical protein |
Protein accession | YP_001432082 |
Protein GI | 156741953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.72567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATGC TGGCGCGTTT GCAGGCGGCG ATGATTGTGT TTCTGGCGCT AGTGGCATAT CTAATGCCGG CGCCGGTTCA GGCGCAGACG ACGCTGCCGC AGTGGTACGA ACTCCGCACG CAAAAATTTG CAATTCTGTA TGCCGACGGC GATCTGGCGC GCGCCGAGGA GTATGCAACC TTTGTTGACG AGGTGTATGA CGAGATTACA TCGATCTTCA GCCATGCCGC GCCGACGCCG GTAACGCTGC GCCTCTATCC GACCCGCCGC GTCTACGACG CCGCCAATCC GTTGGCTGCT CCGATTCAGG GGATCATTGC GCATGCTGAT TTTCGCCGCA ATGAGGTGGT GGTCATTCTC GACCAAACCA CCGCTCAGTC TCCTGAAGAG ATCAAGAATA ACGTGCGCCA CGAACTGACC CACATCGTGC TGGCGGAACT CTCGTCGAAC CGCTTGAACG TCGGGTTCCA CGAAGGCATT GCACAGTATG TCGAGCGCCC GACCCCCGAC CTGGAGCGTA AGGCGACGGC GCTGCGGCAG GCGCTCGAAC GCGATGCGCT CCTGCCCTGG AGCGCCCTCG ATGACCGCGA TCAGATTTAT GGCAGTCCAC AGATCGGGTA TCCGCAGACC CTCTCGATTG TCGCGTTTCT GGTCGAACGT TTTTCGTTCG TCAAACTGCG CGAGTTCGTC ACGGTCAGCG CGCGGAGCAG CGGGTATCGT TCAGCGCTCG AACGGACCTA TGGCATGCCT TCGACCGACC TCGAGCGCAT GTGGCGCGAA TGGCTGCCTT CGTATCTCGA TGGCGGCTTT CGCCACAATG CGCTGACGGA GTATGACCTG ACCCCCATTG AAACGCTGAT CGCCGATGGT CGCTACGCAG AAGCCAAACG CGAACTCGAA CTGGCAATTC CCTGGTTGCG CAATACGCAG CAACATGATG TGCTGGCGCG CGCGCAGGAC TTGCTGGCGC AGAGCGAAGC CGGTCTGTAC GCCGAAGACC TGGCGCAGCA GACGCGCGCG GCGCTCGAGG CGCACGACTA TGCGACTGCG GAGAACCTGG CGAAGCGCGC GCTCGATGCC TATATGACGC TCGAGAACCA GAGCCGTATC GAAACGCTGA CCGTCTATGC AACCATCGCC AGGCGCGGGT TGCGCGCAAC GGAGCTGCTC GAACAGGCAA CCGCGCTCGC CGGCGACTGG CGAACCTTTG CCGATGCCCG CATCATTGCC GATCAGGCTG CCGCTGAGTT TCTTTCGCTT GGCAATCAGG AGAACGCAGC GCGCGCGTTG ACGTTGCGCG CCGAGATTGA TCGCGTGCAG AGTCTTGCCG GCATCGCCTT GCTTATCATC GGGTTGGCGG GTATTGCGGT TGGGTTTACC CGCCGCCTGA TCGTTCGTGA AGCGGAGGTG TGGTGA
|
Protein sequence | MRMLARLQAA MIVFLALVAY LMPAPVQAQT TLPQWYELRT QKFAILYADG DLARAEEYAT FVDEVYDEIT SIFSHAAPTP VTLRLYPTRR VYDAANPLAA PIQGIIAHAD FRRNEVVVIL DQTTAQSPEE IKNNVRHELT HIVLAELSSN RLNVGFHEGI AQYVERPTPD LERKATALRQ ALERDALLPW SALDDRDQIY GSPQIGYPQT LSIVAFLVER FSFVKLREFV TVSARSSGYR SALERTYGMP STDLERMWRE WLPSYLDGGF RHNALTEYDL TPIETLIADG RYAEAKRELE LAIPWLRNTQ QHDVLARAQD LLAQSEAGLY AEDLAQQTRA ALEAHDYATA ENLAKRALDA YMTLENQSRI ETLTVYATIA RRGLRATELL EQATALAGDW RTFADARIIA DQAAAEFLSL GNQENAARAL TLRAEIDRVQ SLAGIALLII GLAGIAVGFT RRLIVREAEV W
|
| |