Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0751 |
Symbol | |
ID | 5538217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 981311 |
End bp | 982459 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640892907 |
Product | hypothetical protein |
Protein accession | YP_001430890 |
Protein GI | 156740761 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00031079 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.110422 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCA ATCGTGAAGT GGGCGGGCGA TTGCCTTATA GTGAGCCGCT TTTGCATACG CTTCTCCGTC TGCTCATTGG CGGAGGCGTT CTCTTGTGTG GCTTGCTTCT GGCGCAGCGC CTGCCGAACC CTGATCTATT CGGCGATTAT GTCGCTGCAT GCGCCTGGTG GCAACGTCTT CCCGAACACC TCTGGCTGGC GGGTCTAAGC AGGTGCGATG CAAACGTGGA TTACAATGCG TTTGGTCCTG GTGCGCATCC TCCCTTCTCA ACGGTCTTTT TCCTTCCGCT TGGCTTCCTC GCCTGGACCG ATGCTCGCTT AGCATGGCTT ATCATCAGTG GCGCGTGTCT GATAGGTGTC TGGCACTACT ACCGCGTTCC TGTCAGTGTC TGCGCTGCAA CGGCGCTTTT TGGCGTCTTC GGGTTGTATC GCGGAACGAT GGAGCCTTTT CTTTTCGCGC TGATGATGGT CGCGCTGTCA CAGGAAGAAG AGCGTCCGCT CTTCTCTGCC GCGCTGATCG GTCTGGCTGC GGCGATCAAG GTCTATCCGG TCCTGATGCT GGCGGCGCTG GTCATTGCGC GTCGTCTCAA TGCCTTGATC GCAGGCATTG TTACCGGCGG ACTTGCGACG GCCGCCGGGG ATTTGGTGCT GGGAATCGGG AAAACCGGCG CCTGGATGGG GCATATGACT CCTAATGCCC TGGCATGGCG GATTAATCCG GACAATCTTT CGCTGGTCCG CATTGCAGGG GACTTCGTTC CGCAACTCTC GCCGTTGGTG GTGGCAGTCG CTCTCTTTGG TGCGGCGGTG GCGCTGCTCA TCAACGCACC GCATGGACAG GTGCGGATGC ACACTCTCGT ACCGACCACT CTGCTTGTGA CGCCATTGGT ATGGAGCCAC TATATTGTTC ATACAGGTCT GCTTCAGTTG ACGCGCCTTG AGCAGGTGTT ACTATTCGCG GGCAGTGGAT TGATCTTTTT GGGTATACTG GGCATCTTCC CGTTCCAGAG CGCTGCCATT GCATACGGAC CGGTGCTCGC AGCACTGGTG TTGATCTGGC ATCGCGCATG GCGATCTGGC ACCGAACTTT CTGTTCGGAA GAAACTGTCA GGCGCCGCCC CTTCCGAATC CTCCATACCC AACTGTTGA
|
Protein sequence | MKRNREVGGR LPYSEPLLHT LLRLLIGGGV LLCGLLLAQR LPNPDLFGDY VAACAWWQRL PEHLWLAGLS RCDANVDYNA FGPGAHPPFS TVFFLPLGFL AWTDARLAWL IISGACLIGV WHYYRVPVSV CAATALFGVF GLYRGTMEPF LFALMMVALS QEEERPLFSA ALIGLAAAIK VYPVLMLAAL VIARRLNALI AGIVTGGLAT AAGDLVLGIG KTGAWMGHMT PNALAWRINP DNLSLVRIAG DFVPQLSPLV VAVALFGAAV ALLINAPHGQ VRMHTLVPTT LLVTPLVWSH YIVHTGLLQL TRLEQVLLFA GSGLIFLGIL GIFPFQSAAI AYGPVLAALV LIWHRAWRSG TELSVRKKLS GAAPSESSIP NC
|
| |