Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2151 |
Symbol | |
ID | 5539631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2761604 |
End bp | 2762695 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640894284 |
Product | hypothetical protein |
Protein accession | YP_001432253 |
Protein GI | 156742124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00896041 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00536287 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCACAA CCCTCCCTGA ACAAATCGCC GCTCTGCCGC CGACCAATCG TGCGCGGTGC GAACGTCTCT TCTTTGTACA GCGCGTCGAG GGTCAGGCGG TCATTCCCCG CGAAATGGAA GCATGGGTTG CCAGCAGTTT CGGCGATATT GCTGGAGTTC GTCGACAAAC GATTATCCGC GTCGTCAATC GCTTCACCCT CGAAGGAACC CTGTTCAATC CGCTGCGCGC GCTGCGACCC GCAGGAAAGA CAACCAGCGA CGCCGACCTG CACGCCTGGA TCGAATCGGA ACTGCGTGAT CGCGACTTCT TCGCCCAGCC GCTCCAGGCG ACCAGCGCCG ACACCTTCGG GCGCATCCGC GGTCGCTTTT GCGTGACTGC ATCGAACATT GCCAAATACG ACGGATGGCA TGGTCTGGTG GTGTTCGATG AACCGCACCC ACTCCACTTC AACCGCGACC AGTTTGCCGA TTACCTGGAT GTGGCGCTGC GCTGGCTCGA CGCAGCGCAC CGGTGCGACC CACAGGCGAT TTACCCGATG ATTACCTGGA ACTGCCTGCC CCGCAGCGGC GCCACTATCG CCCATGGGCA CATGCAGATG TCGCTGGCGC GTGCCATGCA CTATACCAGA CCTGAACTCT GGCGTCGCGC CGCACTCCAG TACGGTGACA TTCCGCGCTA CGTCTCCGAT CTCATTGCGG TGCATGCCGA TCTGAATCTG CTGATTGCCG ATACGCCTGC GGGTCATGTA TTTGCCCATT TGACGCCGCT GCGCAACCGC GAGATTGTCG CTCTGCTGTC ACATCGGGCG GATGCAGCAC TGCTCGCCGA CCGCTTGACC GACCTGATCT ATCCCGTCCT GCGTGCTCTG ATCGATCATC ACCACGTGCG CGCCTTCAAT GTTGGCATTG CGCTGCCGCC ATTCACCGAC AAAACCGGCG CCTGGAGCGG CATGCCGGTG ATCGCGCGTG TCGCCGACCG CGGACCGGCG TTGAGTATCC ACAACGACTG GGGTGCAATG GAACTGTTCG CCACCGGGTG CGTCACGGTT GATCCGTTCG AGGTGGCGAC GGCGCTGAAG CGAGCAGGGT GA
|
Protein sequence | MITTLPEQIA ALPPTNRARC ERLFFVQRVE GQAVIPREME AWVASSFGDI AGVRRQTIIR VVNRFTLEGT LFNPLRALRP AGKTTSDADL HAWIESELRD RDFFAQPLQA TSADTFGRIR GRFCVTASNI AKYDGWHGLV VFDEPHPLHF NRDQFADYLD VALRWLDAAH RCDPQAIYPM ITWNCLPRSG ATIAHGHMQM SLARAMHYTR PELWRRAALQ YGDIPRYVSD LIAVHADLNL LIADTPAGHV FAHLTPLRNR EIVALLSHRA DAALLADRLT DLIYPVLRAL IDHHHVRAFN VGIALPPFTD KTGAWSGMPV IARVADRGPA LSIHNDWGAM ELFATGCVTV DPFEVATALK RAG
|
| |