Gene RoseRS_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3108 
Symbol 
ID5210076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3900788 
End bp3901876 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID640596699 
Productcobalamin synthesis protein, P47K 
Protein accessionYP_001277421 
Protein GI148657216 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.190819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.352092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAG AAGCGCCGCT CCGCCCTGTG CCGGTCACTA TTCTGACCGG TTTCCTCGGC 
GCCGGTAAAA CAACGCTGCT CAACCGGATA CTCCATGCCG ACCACGGATT GAGAGTTGCT
GTGCTGGTAA ACGATTTCGG CAGCATCAAC ATCGATACGC AACTCGTTGT CGGCGTCGAG
GGCGAGACGA TCTCGCTCGC CAACGGGTGC ATCTGCTGTT CGATCCGCGG CGACCTGCTC
AAAACGGCGC TGCTCCTGCT CGACCGCGAG GAACCGCCGG AGTATCTGAT CATCGAGGCA
AGCGGCGTGA GCGACCCGTG GGCGGTGGCG GAAACGTTCG ACCTTCCCGA ACTGCGCCCT
TTCTTCCATC TTGATTCGGT CATCACCGTC GTCGACGTGG AATATGTTCG CAAGCAACAG
TACTACGAGG ACCTGATCAT CCATCAGATC AGCGCAGCCG ACGTGGTTGT ATTGAGCAAA
GTCGATCTGG TGAGCGCGGA GCAGCGCGCC GACGTGGAGC AGTGGGTGCG CCAGATCGTG
CCCCGCGCCC GTATTCTCCC CGCCATCCAC GGCGATGTGC CGATGAGCCT GCTGCTGGGC
GTCGGGCGCT ACAAACTGGA CGTTCAACCG CCGGTCGTGT CCCGCCTCCA TGCCCACGAC
CACGACCACG AGCACGGTCC GCACTGCGAC CACGACCACG ACCACGACCA CGAGCACCAC
CACGACCATA CAACTGAGTT CAGCTCGTGG AGTTATGTCA ATCATCGCCC ATTTGCGCTG
AAAAAGTTGC GTACCGTTCT GCAAGACCTG CCCGAAACCA TCTTTCGCGC CAAGGGGATC
ATCTATCTTG CAGAAACACC GGGCCGCCGC GCCATTCTCC AGATCGTCGG CGTGCGGATT
ACGGTCTCGG TTGGAGAACC GTGGGGCGAC ACGCCTCCCG GCACGCAGAT GGTCTTTCTC
GGAATACCCG GCGGATTGAA CGGCGATGCG CTTCAGAAGG CCTTCGACTC CTGTCTGACC
GACGTGCCGG CACAGGAGCA GGTAGTGGGG CAACAATCGT GGTTCCGCCG CGTGTTCGGC
GGAGCGTGA
 
Protein sequence
MSIEAPLRPV PVTILTGFLG AGKTTLLNRI LHADHGLRVA VLVNDFGSIN IDTQLVVGVE 
GETISLANGC ICCSIRGDLL KTALLLLDRE EPPEYLIIEA SGVSDPWAVA ETFDLPELRP
FFHLDSVITV VDVEYVRKQQ YYEDLIIHQI SAADVVVLSK VDLVSAEQRA DVEQWVRQIV
PRARILPAIH GDVPMSLLLG VGRYKLDVQP PVVSRLHAHD HDHEHGPHCD HDHDHDHEHH
HDHTTEFSSW SYVNHRPFAL KKLRTVLQDL PETIFRAKGI IYLAETPGRR AILQIVGVRI
TVSVGEPWGD TPPGTQMVFL GIPGGLNGDA LQKAFDSCLT DVPAQEQVVG QQSWFRRVFG
GA