Gene Rcas_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1861 
Symbol 
ID5539339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2378160 
End bp2380103 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content65% 
IMG OID640893999 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_001431970 
Protein GI156741841 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.72567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGAC GACGCTACTA TCTTGAGGAT CGCGCCCTGG ATGATGCGGT AGCCCGCTTT 
GAGGCGGCGA TTGAGCGCGT TGGCGGACTG CATCCGCTGG ACGGCGAGAC TGTTCCTCTG
GCTGAGGCGC GGGACCGCGT CACAGCCGCT CCGGTGTGGG CAGCCCGTTC TGTGCCTCAC
TACCACGCAG CAGCCATGGA TGGCATCGCC GTGCGCGCCG CCACGACCGC CGGCGCGACC
GAGTCGTCGC CGCTGACCCT CGCGCTTGGC GAGCAGGCAG TCTGGGTCGA TACCGGCGAC
CCGATGCCGC CCGGCGCTGA TGCTGTGGTG ATGGCCGAGC ACGTTCAGGT GCTCGACGAT
ACGACTGTGG CGATCACCGC TGCGGTAGCG CCTTGGCAGC ATGTGCGACC GATGGGTGAG
GACATCGTTG CGACCGAACT GGTCGTCCCT GAAGGTGTGC GCTTGCGCCC GGTCGATCTG
GGAGCGATCG CTGCCGCCGG TCACGCAACT GTCAGCGTGC GGCGACGTCC GCGTGTGGCG
ATTATCCCCA CCGGCACCGA GTTAGTCACG CCCGAAGCGG CTGCCGAGCG CGAGGCGATA
GGTCATCCGG TGCGCGCCGG TGAGATCATC GAGTTTAACT CACTGATCCT CTCCGGTATG
GTCGAGGAGT GGGGCGGCCT GCCTACGCGC CTGCCCCCTG TGCCTGACCG GCAGGATTTG
TTGCGTGCTG TCATTGTGAG CGCTATCGAC CACCACGATG TCATTGTGGT CAATGCCGGA
TCGTCAGCCG GCGCCGAGGA CTACACGGCG ACCGTGCTCG CTGAACTTGG CGAGGTGGCC
GTCCATGGAG TGGCTATTCG TCCCGGACAC CCCGTGATCC TCGGTGTGGC GGGCGGGAAG
CCGGCTCTGG GACTGCCCGG CTATCCCGTA TCGGCAGCGC TCACCGCCGA ACTGTTCCTC
CGTCCACTGC TGTACCGGCT CCTGGGTCTC ACCCCGCCGC CCCGCCCTGA GGTGACGGCG
ACGATCAGTC GCAAACTGCT CTCGCCGCTG GGCGAGGATG AGTTCGTGCG CGTCACGCTG
GGTCGGGTGG ATGGGCGACT CATCGCTACG CCGCTGGCGC GGGGTGCGGG TGTGGTGATG
TCCCTTGTTC GTGCCGACGG ACGCGCGCGC ATTCCCCGTT TCTCCGAGGG TCTCCATGCA
GGCGCCGAGG TTACTGTCGA ACTCCTGCGC GATCAGGCTG AGATTGAGTC AACCATCGTT
GTCATCGGCA GCCACGATCT GGCGCTCGAC CTGCTGGCAA GTCATGTGCG ACGCGCCGGT
CGGCGCCTCA GTTCAGCCAA TGTGGGCAGC CTGGGCGGTC TGATGGCGCT TAAGCGTCGC
GATGCCCACC TTGCTGGCGT GCACCTCCTT GATGAGGAGA CCGGCGAGTA TAACGCCTCG
TATATACGGC GTCTATTGCC GGACGAAGAG ATTGTGCTGG TTCATCTGGC GTACCGCGAG
CAGGGCTTTC TCGTGGCGCC GGGTAACCCG CTTGGGCTGA GCAGGCTGCG TGATCTGGCG
CGTCCCGGCG TGCGCTTCGT CAACCGGCAG CGTGGATCGG GGACACGTAT GCTGCTCGAT
TATCAATTGC GTCTGGAAGG GATAGACCCC AGCGCCATTA CCGGCTATCA GCGCGAGGAG
TTCACGCACA TGGCGGTTGC TGCGGCAGTG CAGAGCGGCG CTGCGGATGT GGGTCTTGGT
ATCAGCGCCG CTGCGCGCGC TCTTGGTCTT GCCTTTATCC CCCTCTTCAG CGAGCGCTAT
GATCTGGCCG TTCCGCGTCG TCACTGGGAG AGCGAGTTGT TGGCGCCACT GCGGCAGATA
CTTTTCGAAT CGGCGTATCG CAGCGCCGTC GAATCGCTGG GTGGCTACAA CGTGGATCGG
ATGGGTGAAG AGGTGCGGGT CTGA
 
Protein sequence
MSRRRYYLED RALDDAVARF EAAIERVGGL HPLDGETVPL AEARDRVTAA PVWAARSVPH 
YHAAAMDGIA VRAATTAGAT ESSPLTLALG EQAVWVDTGD PMPPGADAVV MAEHVQVLDD
TTVAITAAVA PWQHVRPMGE DIVATELVVP EGVRLRPVDL GAIAAAGHAT VSVRRRPRVA
IIPTGTELVT PEAAAEREAI GHPVRAGEII EFNSLILSGM VEEWGGLPTR LPPVPDRQDL
LRAVIVSAID HHDVIVVNAG SSAGAEDYTA TVLAELGEVA VHGVAIRPGH PVILGVAGGK
PALGLPGYPV SAALTAELFL RPLLYRLLGL TPPPRPEVTA TISRKLLSPL GEDEFVRVTL
GRVDGRLIAT PLARGAGVVM SLVRADGRAR IPRFSEGLHA GAEVTVELLR DQAEIESTIV
VIGSHDLALD LLASHVRRAG RRLSSANVGS LGGLMALKRR DAHLAGVHLL DEETGEYNAS
YIRRLLPDEE IVLVHLAYRE QGFLVAPGNP LGLSRLRDLA RPGVRFVNRQ RGSGTRMLLD
YQLRLEGIDP SAITGYQREE FTHMAVAAAV QSGAADVGLG ISAAARALGL AFIPLFSERY
DLAVPRRHWE SELLAPLRQI LFESAYRSAV ESLGGYNVDR MGEEVRV