Gene Rcas_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2334 
Symbol 
ID5539815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3010976 
End bp3012634 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content63% 
IMG OID640894467 
Productcobalamin B12-binding domain-containing protein 
Protein accessionYP_001432435 
Protein GI156742306 
COG category[R] General function prediction only 
COG ID[COG5012] Predicted cobalamin binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.769724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA AACAAAAAAC CGTTGTCGCC GCTGCGCTCG GCGAATGTGT GCATGTTGCC 
GGCGTGATGA ACTTTCTGCG CCTGGCGGAA GAAGCCGGCT GGCGCACGGT CTTTCTGGGT
CCGGCCACGC CGATTGAGCG GGTGCTCGAA GCCGCGCGAC AGGAGCAGGC CGATCTGGTT
GGCGTGTCCT ACCGCCTGAC GCCGGAAACC GGCGCGCATC TCCTGGGGCG TTTCGCCGAA
GCGGCAGACG ATCTGCATGC CGCAGGGGTG CGTTTCGCCT TCGCCGGAAC GCCGCCGCTG
GCGGAGAAAG CCGCCACGCT CGGTTTCTTC GAGCAGGTGT TCGATGGCAG CGAACCGGCC
GATCAGGTTC TGGCATATCT CAGGGGGCAA AATCCGGCAC ACGCAACCGA GGCGGATTTT
CCGCAACGCA CCGTTGACCG CATTCGCTGG AAAGCGCCAT TTCCCCTGAT CCGTCATCAC
TTTGGTCTGC CGACGATGCA GGCGACCATC GATGGCATTG CGCGCATCGC CGAAGCGCGC
TGTCTCGATG TGATTTCGCT CGGCACCGAT CAGGACGCCC AGGAGAACTT TTTCCGCCCC
GAACGTCAGG ACCCGGCGCG TACCGGCGCC GGCGGCGTGC CGGTGCGTAG CGCCGACGAC
TATCGCGCCC TCTACGCTGC CAGTCGACGT GGCAACTACC CGCTTATGCG CACCTACTCC
GGCACTGATG ATTTCGTGCG CCTGGCAGAG TTGTATGTCG AAACGATCAA CATCGCCTGG
TGCGCCATCC CGCTCTTTTG GTTCAACCGC ATGGACGGGC GCGGACCGTG GGACCTGGAA
GGGTCGATCC GCGAACATCA GACGATTATG CGCTGGTATG GCGAGCGGGA CATTCCGGTT
GAACTCAATG AGCCGCACCA TTGGGGCATG CGCGATGCGC CCGATACGGT CTTTGTTGCC
AGCGCCTATC TCTCGGCGTA CAATGCGCGC GCGTTCGGCG TGCGCGACTA TATCGCGCAA
CTGATGTTCA ACAGCCCGCC CGGTTTGTCC GATGCCATGG ACCTGGCAAA GATGCTGGCG
GTGATCGAGA TTACTGCGCC GCTGGCAGGA CCGGATTTCC GCATCTGGAA ACAGACCCGC
ACCGGGTTGC TGAGTTACCC GGTTGATCCG GCTGCGTCGC GGGCGCATCT TTCGGCAAGC
ATTTACCTGC AAATGGCGCT GCGTCCGCAC ATCATTCACG TCGTTGGGCA TACCGAAGCG
CACCATGCTG CCACCGCCGA TGATGTGATC GAAGCATGCG GCATAGCTCG CCGCGCTATT
GAAAACGCGC TGCGCGGGCA ACCCGACATG ACCGCCGATC CGTCCGTGCG CGCGCGAGCG
GCGCAACTGG TCGAAGAAAC CCATCTCCTC CTCAATGCGA TGGCGCAACT TGCGCCTCCC
GGCGTGACCG ACCCGCTGAC CGACCCCGCC ACACTGACGA AAGCCGTGGA GATTGGACTG
CTCGATGCTC CGCAATTGCG CAATAACCCG TTCGCACCGG GTCGCGTCGC AACCCGCTTC
ATCAACGGCA TGTGCCTGGC CGTCGATGCG CAGGGACGCC CGCTCGACGA GAAAGAACGC
ATCCGACTGG CGCTGGATCA TGCAACAATG TCAGCCTGA
 
Protein sequence
MTEKQKTVVA AALGECVHVA GVMNFLRLAE EAGWRTVFLG PATPIERVLE AARQEQADLV 
GVSYRLTPET GAHLLGRFAE AADDLHAAGV RFAFAGTPPL AEKAATLGFF EQVFDGSEPA
DQVLAYLRGQ NPAHATEADF PQRTVDRIRW KAPFPLIRHH FGLPTMQATI DGIARIAEAR
CLDVISLGTD QDAQENFFRP ERQDPARTGA GGVPVRSADD YRALYAASRR GNYPLMRTYS
GTDDFVRLAE LYVETINIAW CAIPLFWFNR MDGRGPWDLE GSIREHQTIM RWYGERDIPV
ELNEPHHWGM RDAPDTVFVA SAYLSAYNAR AFGVRDYIAQ LMFNSPPGLS DAMDLAKMLA
VIEITAPLAG PDFRIWKQTR TGLLSYPVDP AASRAHLSAS IYLQMALRPH IIHVVGHTEA
HHAATADDVI EACGIARRAI ENALRGQPDM TADPSVRARA AQLVEETHLL LNAMAQLAPP
GVTDPLTDPA TLTKAVEIGL LDAPQLRNNP FAPGRVATRF INGMCLAVDA QGRPLDEKER
IRLALDHATM SA