Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1759 |
Symbol | |
ID | 5539237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 2261687 |
End bp | 2262757 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893898 |
Product | molybdenum cofactor biosynthesis protein A |
Protein accession | YP_001431869 |
Protein GI | 156741740 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2896] Molybdenum cofactor biosynthesis enzyme |
TIGRFAM ID | [TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.103881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACG ATGATGTACG CCGGATGACC ATTCCGCTAT ACACGGCCGA ACAAGCACGG CAGGCGCGAC CGTTCTACCC GACCGATACG CCAGCGCACG ATTCGTATGG ACGCCGGATC GACTACCTGC GCATCTCGCT GACCGACCGT TGCAACATGC GGTGTGTTTA CTGCATGCCG GAGATCGGCA TGCAATTTAT GCCGCGCCCG GAGTTGCTGA CGACGGACGA ACTGTTGCTG GTGGTGCGCG CGGCTGCCAG AGCCGGATTT CGCAAAATCC GATTGACCGG CGGCGAACCG ACGTTACGCC CCGATATTGT CGAGATTGTG CGGGAGATCA AGCGCATCCC CGGCATTACT CACCTTGCCA TGACGACCAA TGCGCTGCGG CTGGAAAAAC TCGCCGAACC GCTGAAAGCC GCCGGTCTCG ACCGGGTGAA TATCAGCATC GACACGCTCG ATCCGGAGAA GTTTCGGAAT ATGACACGCG GCGGATCGTT CGAGAAAGTC TGGGCAGGAA TCGAAGCCGC CGACCGTGTG GGGTTGCACC CGCTCAAACT CAATTCGGTC GTTGTGCGCG GGATGAACGA CGACGAGGTT CCACGCCTGG CTGCGTTGAC GCTCCGCTAC CCGTGGGAGA TGCGCTTCAT CGAGGTGATG CCGCTGACCG GCGTGGCAGA CTTGGCGCAG AGCAGTGTGG TGACGAGCGC CGAACTGATT GCGCGCCTGG AGTCAGTCTA TGGTCCGCTC GAAGACCTCG GACTGGCGCC GGCCGACTCG GCGCGACGCT ACCGCATCCC CGGAGCGCCG GGCAAACTGG GGTTCATCAG TTCGGTGAGC GAACCGTTCT GCGCCACGTG CAACCGGATG CGGCTGACGT CCGACGGGCG CCTGCACCTC TGTTTGCTGC GCGATCATGA GGTTGATCTA CGCGCCGCCA TCCGTAGTGG CGCCACACTC GATGAGATCG AGCAGATTAT TCGCTACGCC GTGGCGCTCA AACCCTGGGG TCATGGCCTG CCCGACGGCG TGCTGCCGAC ACTGCGCGGC ATGTCGGAAC TCGGAGGATA G
|
Protein sequence | MQYDDVRRMT IPLYTAEQAR QARPFYPTDT PAHDSYGRRI DYLRISLTDR CNMRCVYCMP EIGMQFMPRP ELLTTDELLL VVRAAARAGF RKIRLTGGEP TLRPDIVEIV REIKRIPGIT HLAMTTNALR LEKLAEPLKA AGLDRVNISI DTLDPEKFRN MTRGGSFEKV WAGIEAADRV GLHPLKLNSV VVRGMNDDEV PRLAALTLRY PWEMRFIEVM PLTGVADLAQ SSVVTSAELI ARLESVYGPL EDLGLAPADS ARRYRIPGAP GKLGFISSVS EPFCATCNRM RLTSDGRLHL CLLRDHEVDL RAAIRSGATL DEIEQIIRYA VALKPWGHGL PDGVLPTLRG MSELGG
|
| |