Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2817 |
Symbol | |
ID | 5540304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3652855 |
End bp | 3654636 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640894944 |
Product | Rhs element Vgr protein |
Protein accession | YP_001432906 |
Protein GI | 156742777 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACA ATAACAATGA TCGGCATGTT TCCGATTTTT ACCTGAAACT CGACGGCGCC GACGCGCCGC TGGAACTGGT GCGGGACATC CTGGACATCA CGATCGAGAA CAGTCTGCAC CTGCCGGATG TCGCTACGCT CGTCATCAAC GACCGCCGTT TGACGTGGGT TGATGACAGC CGCCTGGCGC CGGGCAAGAC GCTTGTGGTG AATGTCCGGC GCAATCGCGC AACCGAGACG ATCTTCGATG GCGAGATTGT CGAAGTTGAA CCGCACTTCG ATGTAGAAGG AGCGCGGGTT GTGGTTCGCG CCTTTGATCG TCTTCACCGC CTGGCGCGCG GTCGTTACGC CCGCACCTTT CTGAATGTGA CCGATAGCGA CGTGATCCAA CAGATTGCCG GTGAAGTGGG GTTGCAGGCG CAGGTGGATG CGACGAGCCG GGTTCACGAG TATCTTGTGC AGTGGAACCA GACAAACCTG GAATTTCTGC GCGAACGGGC GGCGGCGCTC GGCTATCTGT TGTATGTCGA TGGACGGAAA CTCTTCTGTG TCAAGCCCCC TTCCGCCAGA GCACCGGTCG AGTTGAAATG GGGTGAGGAT CTGAGCGCCT TTCACCCGCG CTTGAGCACG ATCGATCAGG TGAATGAGGT GAATGTGCGC GGTTGGGACC CACAAAAAAA AGAAGTGGTG ATCGGTCAGG GAACGACTGG CGCGGTGACG CCTGCGATTG GCGTCCCCGA CAAAGGCGGC GCGCTGGCGA AGAAAGCGTT CAGCATCACG GCGAAAGATT TGCTGACCGA GTCCGCCGTG CGCTCGCAAT CGGAAGCGGA ACAGATCGCA AAAGCGGTGC TCAATCAGCA TGAAAGCCGG TGTATCGAAG CGGATGGCAC GGCTGCCGGC AACCCGAAGA TCAAGGCGGG AGCAGCGGTC AAAATAACCA ACATTGGCAT TCGCTTCAGC GGCGCCTATG TTGTCACCAG CGCGACGCAT CGCTATAACG CTGCTGGTTA TGTCACCGAC TTTTCGGTGT CGGGGGTCAA CCCGGATGCT CTGCTGCACC TGCTGCAACC GGAAACGCCA CGCCTGAGGA TTGAAGGACT GGTTATCGGG ATTGTGACCG ACAACAATGA TCCGGACAAC CTGGGGCGTG TCAAGGTGAA GTTTCCGACG CTTTCGGATC AACAGAGCCA TTGGGCGCGA GTGGTCAGTG TCGGCGCAGG CGCCAACCGT GGGATCGAGT TTCTGCCGGA AGTGAACGAT GAAGTGCTGG TGGGATTCGA GGCTGGCGAT ATGCGCGCCG TGTATGTGAT CGGTGGTTTG TGGAACGGCA AGGATGCTCC GCCGAAGAAG ACCGGCGAGA TCGTCAAAGG CGGGAAAGTT GAGCAACGTG TGGTCAAATC GCGCTCCGGT CATGTGATTA CACTGGATGA TAGCGATAGT GCGCCGTCGA TTACCATCGA AGACAAGAGT GGCAATATGA TCAAACTCGA CAGCAAGAAG AACGAGTTGA CGATCAAAGT CAAAGGGAAT GGCACGATCA GCGCCGATGG CAACCTGACG ATCCAGGCAA AGGGCAAGAT TGACATCAAA TCGCAGCAGG CAATGGCTAT CGAAGGCGCC ACCGGTCTCG ATCTGAAGTC GAATGCAAAT GCCTCCTTGC AGGCGAATGC CAGCCTCGAT CTGAAATCCA ATGCTACGGC CTCCTTGCAG GCGAATGCGA CGCTCGATGT GAAATCGTCG GCGATTCTGA CGATCCAGGG AACGCTGGTC AAAATCAACT GA
|
Protein sequence | MSNNNNDRHV SDFYLKLDGA DAPLELVRDI LDITIENSLH LPDVATLVIN DRRLTWVDDS RLAPGKTLVV NVRRNRATET IFDGEIVEVE PHFDVEGARV VVRAFDRLHR LARGRYARTF LNVTDSDVIQ QIAGEVGLQA QVDATSRVHE YLVQWNQTNL EFLRERAAAL GYLLYVDGRK LFCVKPPSAR APVELKWGED LSAFHPRLST IDQVNEVNVR GWDPQKKEVV IGQGTTGAVT PAIGVPDKGG ALAKKAFSIT AKDLLTESAV RSQSEAEQIA KAVLNQHESR CIEADGTAAG NPKIKAGAAV KITNIGIRFS GAYVVTSATH RYNAAGYVTD FSVSGVNPDA LLHLLQPETP RLRIEGLVIG IVTDNNDPDN LGRVKVKFPT LSDQQSHWAR VVSVGAGANR GIEFLPEVND EVLVGFEAGD MRAVYVIGGL WNGKDAPPKK TGEIVKGGKV EQRVVKSRSG HVITLDDSDS APSITIEDKS GNMIKLDSKK NELTIKVKGN GTISADGNLT IQAKGKIDIK SQQAMAIEGA TGLDLKSNAN ASLQANASLD LKSNATASLQ ANATLDVKSS AILTIQGTLV KIN
|
| |