Gene Rcas_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0644 
Symbol 
ID5538107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp845840 
End bp847258 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content62% 
IMG OID640892801 
Productcobalamin (vitamin B12) biosynthesis CbiX protein 
Protein accessionYP_001430787 
Protein GI156740658 
COG category[S] Function unknown 
COG ID[COG2138] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACC CATCAAACCA ACCGGCACTG CTGCTGATAG GTCACGGCAC CGACGATCCC 
GCCGGGCTGG AAGAGTATCA TCGGATGGCG ACACTGGTCG GTGAGCGATT GGGCATTGTT
GTGCAACCCT GTTTTCTCGA ACTGGCAGAC CCGCCGATCA GTCAGGCGAT TGACGACTGT
GTGCGCGCCG GATTCCGGCA GATCGTCGCG CTGCCGCTGC TCCTTGGCGC CGCCGGTCAT
CAGAAGAATG ATATTCCGGT GGCGCTTAAT CAGGCGCGTA TGCGTCATCC CAATCTCGAC
ATTCGCTACG GATCGCCGCT CGGTGTACAA TACGCTCTGG TGCGCGCTAT GGCGGAACGC
ATCGAAACCA CATATGCTGC CACGTCCGCG CGCATTCCCC GCAATAGAAC CGCGCTCGCG
CTCATCGGGC GCGGCAGCAG CGATCCCGAC AGCAACGCCG ATGTGGCGCG AATGGCGCGT
CTGCTCTGGG AAGGGCGCGG GTTCGGGTGG GTGGAGTATG GCTTCTTCAG CATTACCCGT
CCCGATGTTG CCGCTATCAT TCGTCACTGT ATCGCCCTTG GCGCGGAGCA GATCATCGTT
GCGCCGTACC TGCTCTTTAC CGGGCGCATT CTTCAGCGGA TGACGTCGCA GGTGGAGAGC
GCCCGAAAGG AACACCCTGC GCTACCCATC CTGATGGCGG AACATCTGGG CTTGCACGAA
GGCGTGCTTG CCGCCATTCT TCAGCGGTAC GACGAAGCAT TGCACGGCGT TGCCGCCGTT
AACTGCGACC TGTGCAAATA TCGGCAAGTA ATGCCGGGAT TCGAGGATGA CCATGGTCGT
CTCCAGAAGA GTGACCACCA TCATGGTTTG CGCGGCGTCC ATCATCACGA TGCTCCCGCG
CTCGACACCA TCCTGCCGCC GCGCTACCGC AACGGCAAAC CGGTCAGCGT AGCGCCGATG
AGTGCCGCGC CACTCGTCTA CGACGATGAA GGACGGGTGG CGTGGGATCG GGTCTGGGGC
GGGGACGATC CGAACAACCC GTTTTGTGAA CTGGCGCTGG CAGGCGGACC GCCACACCGG
GGGACGCTGC TCGAACCGGT GTCGCCGGAA GCCGTTGCCG CCGATCCAGA AGGATACGCG
CGCGTGGTCG CCGAACTGGC GCGTGGTCTG CGCATGGTCA CCGGCTTGCC GGTTGTAACC
GGGAACACGC CCGGATGGGT CGGTCTTGTG TGCGAAAGTG AAGCCATGGC ACTCTGGTTG
CTGCGGGCGA TTGTGGTCGA GAACGTCAGT GTGCGCCGTG AGGGATGCAC CCTCTTCCTC
CCTGCCGGAC CCGACTTTCG CCTGGACGGG GAGATCAAAA ATGTTGTGAC AGCAGTCGCA
AAGACGTATC ACTACTGGAA GGAGCACGTG CAGGGGTAA
 
Protein sequence
MTHPSNQPAL LLIGHGTDDP AGLEEYHRMA TLVGERLGIV VQPCFLELAD PPISQAIDDC 
VRAGFRQIVA LPLLLGAAGH QKNDIPVALN QARMRHPNLD IRYGSPLGVQ YALVRAMAER
IETTYAATSA RIPRNRTALA LIGRGSSDPD SNADVARMAR LLWEGRGFGW VEYGFFSITR
PDVAAIIRHC IALGAEQIIV APYLLFTGRI LQRMTSQVES ARKEHPALPI LMAEHLGLHE
GVLAAILQRY DEALHGVAAV NCDLCKYRQV MPGFEDDHGR LQKSDHHHGL RGVHHHDAPA
LDTILPPRYR NGKPVSVAPM SAAPLVYDDE GRVAWDRVWG GDDPNNPFCE LALAGGPPHR
GTLLEPVSPE AVAADPEGYA RVVAELARGL RMVTGLPVVT GNTPGWVGLV CESEAMALWL
LRAIVVENVS VRREGCTLFL PAGPDFRLDG EIKNVVTAVA KTYHYWKEHV QG