Gene Rcas_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2945 
Symbol 
ID5540435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3819276 
End bp3820496 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content61% 
IMG OID640895065 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001433024 
Protein GI156742895 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.879984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0065134 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCAAC ATACTGCGCA ACCGTTGCCG GTAACGGTTC TGTCGGGGTT TCTGGGCAGC 
GGTAAAACGA CCCTGCTCAA CCATGTGCTT GCCAATCGCG AAGGTCTGCG CGTGGCAGTC
ATCGTCAACG ACATGAGCGA GGTCAACATC GACGCCCGGC TGGTGCGCAG CGGCGGGGCG
GCCCTCAGTC GCACCGAAGA GCGCCTGATC GAGATGACCA ATGGGTGCAT CTGCTGCACG
CTGCGCGAGG ATTTGTTGGT TGAGGCGGCG CGCCTGGCGC GTGAAGGGCG CTTCGATTAT
CTGCTCATCG AGTCGACCGG CATCTCCGAG CCGCTGCCGG TGGCGGAGAC GTTCACGTTC
GCGGATGAAA CCGGCGTCAG CCTGGCGGAA CTGGCGCGGC TGGATACGAT GGTGACGGTC
GTTGATGCGT TCAATTTTCC GCAGGATTTG TGCTCGACCG ACGACCTGCG TGATCGGAAC
ATGGCTGCCG ACGACGATGA TGAACGGTCG GTTGTTGATT TGTTGATCGA TCAGGTTGAG
TTCGCCGATG TTCTGGTGCT GAACAAGATC GATCTGGTCG ATCCCGATGT GGTGGATCAA
CTGGAAGCGC TTCTGCGCAA ACTGAACCCC GATGCCCGCA TTGTGCGCGC GTCGTTTGGG
CGTGTGCCGC TGCGCGAGAT ATTGAATACC GGTCGCTTCA ATTTTGAGCG CGCGGCGCAG
GCGCTTGGCT GGCTTAAGGA ACTTCGCGGC GAACATACGC CGGAAACCGA GGAGTATGGC
ATTTCGAGTT TTGTCTATCG CGCTCGACGA CCGTTTCATC CTCAACGTTT CTGGGACCTC
ATTCACGATG AGTGGCCCGG TGTGTTGCGT TCTAAGGGGC TGATCTGGCT GGCGACGCGC
ATGAGTATCA GCGGTCTCTG GTCGCAGGCC GGGAGTGCGT GTCGGGTCGA GCCAGGCGGC
TTGTGGTGGG CGGCGCTGCC GGATGATGAA TTGCCAGATG ATCCTGAAGA TGAAGCGCAT
CTGGCGCAGG TATGGCACAG TCGGTGGGGC GATCGGCGGC AGGAACTGGT GCTGATCGGG
CAGGATATGG ACGAGGCGGC GCTGCGCGCT CGCCTTGATG CCTGCCTGTT GACCGACGAC
GAGATGGCGT TGGGTCCCGA AGGGTGGGCG CAGTTTGACG ATCCTTTCGG GACATGGTCG
GTGTGGGTGT CCGAGGATTG A
 
Protein sequence
MAQHTAQPLP VTVLSGFLGS GKTTLLNHVL ANREGLRVAV IVNDMSEVNI DARLVRSGGA 
ALSRTEERLI EMTNGCICCT LREDLLVEAA RLAREGRFDY LLIESTGISE PLPVAETFTF
ADETGVSLAE LARLDTMVTV VDAFNFPQDL CSTDDLRDRN MAADDDDERS VVDLLIDQVE
FADVLVLNKI DLVDPDVVDQ LEALLRKLNP DARIVRASFG RVPLREILNT GRFNFERAAQ
ALGWLKELRG EHTPETEEYG ISSFVYRARR PFHPQRFWDL IHDEWPGVLR SKGLIWLATR
MSISGLWSQA GSACRVEPGG LWWAALPDDE LPDDPEDEAH LAQVWHSRWG DRRQELVLIG
QDMDEAALRA RLDACLLTDD EMALGPEGWA QFDDPFGTWS VWVSED