Gene Rcas_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2817 
Symbol 
ID5540304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3652855 
End bp3654636 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content57% 
IMG OID640894944 
ProductRhs element Vgr protein 
Protein accessionYP_001432906 
Protein GI156742777 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACA ATAACAATGA TCGGCATGTT TCCGATTTTT ACCTGAAACT CGACGGCGCC 
GACGCGCCGC TGGAACTGGT GCGGGACATC CTGGACATCA CGATCGAGAA CAGTCTGCAC
CTGCCGGATG TCGCTACGCT CGTCATCAAC GACCGCCGTT TGACGTGGGT TGATGACAGC
CGCCTGGCGC CGGGCAAGAC GCTTGTGGTG AATGTCCGGC GCAATCGCGC AACCGAGACG
ATCTTCGATG GCGAGATTGT CGAAGTTGAA CCGCACTTCG ATGTAGAAGG AGCGCGGGTT
GTGGTTCGCG CCTTTGATCG TCTTCACCGC CTGGCGCGCG GTCGTTACGC CCGCACCTTT
CTGAATGTGA CCGATAGCGA CGTGATCCAA CAGATTGCCG GTGAAGTGGG GTTGCAGGCG
CAGGTGGATG CGACGAGCCG GGTTCACGAG TATCTTGTGC AGTGGAACCA GACAAACCTG
GAATTTCTGC GCGAACGGGC GGCGGCGCTC GGCTATCTGT TGTATGTCGA TGGACGGAAA
CTCTTCTGTG TCAAGCCCCC TTCCGCCAGA GCACCGGTCG AGTTGAAATG GGGTGAGGAT
CTGAGCGCCT TTCACCCGCG CTTGAGCACG ATCGATCAGG TGAATGAGGT GAATGTGCGC
GGTTGGGACC CACAAAAAAA AGAAGTGGTG ATCGGTCAGG GAACGACTGG CGCGGTGACG
CCTGCGATTG GCGTCCCCGA CAAAGGCGGC GCGCTGGCGA AGAAAGCGTT CAGCATCACG
GCGAAAGATT TGCTGACCGA GTCCGCCGTG CGCTCGCAAT CGGAAGCGGA ACAGATCGCA
AAAGCGGTGC TCAATCAGCA TGAAAGCCGG TGTATCGAAG CGGATGGCAC GGCTGCCGGC
AACCCGAAGA TCAAGGCGGG AGCAGCGGTC AAAATAACCA ACATTGGCAT TCGCTTCAGC
GGCGCCTATG TTGTCACCAG CGCGACGCAT CGCTATAACG CTGCTGGTTA TGTCACCGAC
TTTTCGGTGT CGGGGGTCAA CCCGGATGCT CTGCTGCACC TGCTGCAACC GGAAACGCCA
CGCCTGAGGA TTGAAGGACT GGTTATCGGG ATTGTGACCG ACAACAATGA TCCGGACAAC
CTGGGGCGTG TCAAGGTGAA GTTTCCGACG CTTTCGGATC AACAGAGCCA TTGGGCGCGA
GTGGTCAGTG TCGGCGCAGG CGCCAACCGT GGGATCGAGT TTCTGCCGGA AGTGAACGAT
GAAGTGCTGG TGGGATTCGA GGCTGGCGAT ATGCGCGCCG TGTATGTGAT CGGTGGTTTG
TGGAACGGCA AGGATGCTCC GCCGAAGAAG ACCGGCGAGA TCGTCAAAGG CGGGAAAGTT
GAGCAACGTG TGGTCAAATC GCGCTCCGGT CATGTGATTA CACTGGATGA TAGCGATAGT
GCGCCGTCGA TTACCATCGA AGACAAGAGT GGCAATATGA TCAAACTCGA CAGCAAGAAG
AACGAGTTGA CGATCAAAGT CAAAGGGAAT GGCACGATCA GCGCCGATGG CAACCTGACG
ATCCAGGCAA AGGGCAAGAT TGACATCAAA TCGCAGCAGG CAATGGCTAT CGAAGGCGCC
ACCGGTCTCG ATCTGAAGTC GAATGCAAAT GCCTCCTTGC AGGCGAATGC CAGCCTCGAT
CTGAAATCCA ATGCTACGGC CTCCTTGCAG GCGAATGCGA CGCTCGATGT GAAATCGTCG
GCGATTCTGA CGATCCAGGG AACGCTGGTC AAAATCAACT GA
 
Protein sequence
MSNNNNDRHV SDFYLKLDGA DAPLELVRDI LDITIENSLH LPDVATLVIN DRRLTWVDDS 
RLAPGKTLVV NVRRNRATET IFDGEIVEVE PHFDVEGARV VVRAFDRLHR LARGRYARTF
LNVTDSDVIQ QIAGEVGLQA QVDATSRVHE YLVQWNQTNL EFLRERAAAL GYLLYVDGRK
LFCVKPPSAR APVELKWGED LSAFHPRLST IDQVNEVNVR GWDPQKKEVV IGQGTTGAVT
PAIGVPDKGG ALAKKAFSIT AKDLLTESAV RSQSEAEQIA KAVLNQHESR CIEADGTAAG
NPKIKAGAAV KITNIGIRFS GAYVVTSATH RYNAAGYVTD FSVSGVNPDA LLHLLQPETP
RLRIEGLVIG IVTDNNDPDN LGRVKVKFPT LSDQQSHWAR VVSVGAGANR GIEFLPEVND
EVLVGFEAGD MRAVYVIGGL WNGKDAPPKK TGEIVKGGKV EQRVVKSRSG HVITLDDSDS
APSITIEDKS GNMIKLDSKK NELTIKVKGN GTISADGNLT IQAKGKIDIK SQQAMAIEGA
TGLDLKSNAN ASLQANASLD LKSNATASLQ ANATLDVKSS AILTIQGTLV KIN