Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2158 |
Symbol | |
ID | 5539638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2771944 |
End bp | 2773869 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640894291 |
Product | hypothetical protein |
Protein accession | YP_001432260 |
Protein GI | 156742131 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000475979 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCATCAA AACATCTGTC TCGCCGCGCG GCTCTGTCTG CCGCTGCGAC ATTCGCCGCG CTTGCGTCGA TCCCTTCCGC GTTTGCGCAG GAAGGCGTAG CGCTTGTTGC CCGTCGCGCG GTGGTTGCGC CGCTCCAACC GGTCGAGGTT GACGTGCGCA TTCCCGGATA CACCGGTCCT GCTGCCGTTA TCCTGTTCGA CAGCCGCAAA CGGTTCGCCG GCGTTGCCGA AGGACACGTC GAGAACGGCA TTGGAACCCT GATCGCCGTG CCGCGCGGCG CGCCCGGTGC GCAGTGGGTG GCGTTGTTCG CCTCTGGTCG ATTCGTCGCC GCCAATCCGG CGCTGTTCTC ACTCGATGCG CGCACCGAAA TCTGGACCGG GCAGGAACGC TTCGACCGGT TTGTGCCGAA TGCGGCTGCC ATATTTGCTG CTGCCACGCT GTCGTACACC TTCAATGGCG CATTCTTTCA CGGTTACCGT TCGCCCGACA GCCCGCTGAT CTGGTTGCGC GACCACGTGT ACGCACAACG CGGCGCGCGC TACTTCGACG CCGATCTCAA AACCGCCTTT GACGACTTTC GCCGCTACCA GCAACCGGAT GGCAGTTTCC CCGATTTCCT GCCGCGCCCT CCATGGACTG ATCGCGCGCT CCGGGTGCCG GTCGAAGCCG ACGTTGAGTA CCTCTACGTC CAGGGAGTGT ACGAAGCCTG GCAGGCGACT GGCGACGATG CCTGGATGCG TAGCCACCTG GAACCGATGC GGCGCGCCGT GACGTACTCG TTGCAGCACC CGCTGCGCTG GGACGCCGAA CGTGGTCTGA TCAAGCGCCC TTTCACCATT GACACCTGGG ATTTCGAGTA CGGTTCGACG ACGACCGACC CGGAAACGGG CAAGCCTGCG CCGCGCCACT GGATCGACGA CAAGACGATC TGGGGCGTTT TTCACGGCGA CAACACCGGC ATGGCACAGG CGCTGACGAT GCTGGCGCGG ATGGAAGAGC GCGTCGGCGA TGCAACCCTG GCGCGTGTCT GGCGTGATGT TGCCGCCGGT CTGATACGCA ACCTGAATGC GCTCAGTTGG AACGGGCGCT TCTTCCGCCA TCATGTCCCC TTTCAGTCCT TCGACATCCC CGGCGTTGAT CGGGAGCGGC AGTTGAGCCT CTCGAACGCC TATGCGCTCA ACCGTGGCGT GCTCACCGTT CAGCAGGGGC AGGCGATCAT CGACGAATAT ATCGAACGCT CGAAGACTAT GCGCGCATTC GCGGAATGGT TTAGCATCGA TCCGCCCTTT CCACCGGGAA GTTTCGGGCT TGCGGGGCGC AGCGGCGAAC TCCCCGGCGC GTATGTCAAC GGCGGCATCA TGCCGATCAC CGGCGGCGAA CTGGCGCGCG GCGCATTTCG CTACGGCAAC GAAACCTATG GCTTCGCCAT TCTCGAACAC TACTGGCTGC GCATGCTCAG TCGCGGGCGC ACCTTTCTCT GGTACCATCC CGACGGCGCA GAAGGGGTCG GCTCCGATGA CACCATTCCG ACCGATGCGT GGGGGACGGC TGCGATGTTT ACTGCGCTGA TCGAGGGCGC TGCCGGCATC GAGGATCAGG GCATCGCCAT GCGCGATGTA ATCGTCAGCC CTCGCTGGGG CGCCGCTGGT CTGACCTCGG CGTATGTCTC GGCGCGCTAC CCGGCGAGCG ACGGGTATCT GGCGTATGCC TGGCGTCAGC ATCCGCGCCG TATCGACCTC GACCTGAGCG GGGTCTTTGA TCGCGCGCGA GTGAGGGTGC TGCTGCCGCA GGACACGCCG GGATCGGTCG AAGCGCTGGT CAACGGTGTG CCCGTGCCGC ATACCATCGA AACCCTGCGC GCCAGCCGGT ATGTCATCAT CGACGTAGCG GATATGGCAG TTGTTCAGGT ACAGGTGCGC TGGTAG
|
Protein sequence | MPSKHLSRRA ALSAAATFAA LASIPSAFAQ EGVALVARRA VVAPLQPVEV DVRIPGYTGP AAVILFDSRK RFAGVAEGHV ENGIGTLIAV PRGAPGAQWV ALFASGRFVA ANPALFSLDA RTEIWTGQER FDRFVPNAAA IFAAATLSYT FNGAFFHGYR SPDSPLIWLR DHVYAQRGAR YFDADLKTAF DDFRRYQQPD GSFPDFLPRP PWTDRALRVP VEADVEYLYV QGVYEAWQAT GDDAWMRSHL EPMRRAVTYS LQHPLRWDAE RGLIKRPFTI DTWDFEYGST TTDPETGKPA PRHWIDDKTI WGVFHGDNTG MAQALTMLAR MEERVGDATL ARVWRDVAAG LIRNLNALSW NGRFFRHHVP FQSFDIPGVD RERQLSLSNA YALNRGVLTV QQGQAIIDEY IERSKTMRAF AEWFSIDPPF PPGSFGLAGR SGELPGAYVN GGIMPITGGE LARGAFRYGN ETYGFAILEH YWLRMLSRGR TFLWYHPDGA EGVGSDDTIP TDAWGTAAMF TALIEGAAGI EDQGIAMRDV IVSPRWGAAG LTSAYVSARY PASDGYLAYA WRQHPRRIDL DLSGVFDRAR VRVLLPQDTP GSVEALVNGV PVPHTIETLR ASRYVIIDVA DMAVVQVQVR W
|
| |