Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1863 |
Symbol | |
ID | 6375554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2018936 |
End bp | 2021734 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642684359 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001960261 |
Protein GI | 189500791 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0514904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAATATCACG GCGCCAGTTT CTCAAGGCAG CAGGCGTACT GGGAGGAATG GCTCTGCTTC GGCCAGTATG GGATTCAGGC AGGGAGGAAC GTGCTGTTCA GGCACAATCA TCTTCCGGGA ATGTCATCTG GGTGCCAAGC ATTTGTAATT TCTGTTCTTC TTTCTGCGAT ATCAACGTCG CGGTAAAGAC AGTTGACGGT CAAAAACGTG CAGTTAAGAT AGAGGGAAAC CGGAACAGTC CGCTTAATCG CGGCAAAATC TGTGCACGTG GTCAGGCAGG GCTTCGCCAG CTATATGATC CCGACCGCAT CAAGGAACCG CTTATCAGGG TTGAAGGCAG CAAACGCGGT GAATGGAATT TCCGTGTCGC CACATGGGAT GAGGCATACG ACTATATCGC CTCACGCTTT AAAAAAGTCA ATCCCTGGGA AATCGCCATG ATCGGCGGCT GGACAGCCTG CGTTTCCTAC ATGCACTTCA GTCTTCCTTT TGTCCGTACA CTTGAAATCC CGAATATCAT TGCGTCGCCG ATGCAGCATT GCGTCACAGC AGGGCATCTG GGGACCGATC TGGTCACCGG GAATTTCAAT GTTCATGACG AAGTTCTCGC CGATTTTGAA AACGCGCGAT ACATTCTCTT CAGCCAGAAC AACGCCTCCG TTGCAGCGAT TTCCACGGCA CGTGCCGTCC GGTTCAGCGA AGCGAGAAAA AAAGGAGCAA AAGTCGTCTG CCTCGATCCG AGACAGAGCG AGCTTGCCTC CAAAGCCGAC GAGTGGATCG CGATCAGGCC GGGAACCGAC CATGCCTTTT TTCTCGCAAT GATCCATACC ATGCTCCGCG AGGAGCTGTA TAATAAAGCG TTCCTGGCAA ATCACACCAA CGCGCCGTTC CTGACGTTCA AAGAGAGCAG CGGTACAAGA GAACTTGCTG CCGGCATGGG AAGCGACGGG AAACCGGACG CATATTATAT TTATGATGAA ATCAGCGGCA CAGTGCGCCC TGTTGCCGCA TACACCAACC GGAACGACCG CACCAAAAAC GGCGAACCTG TTGTCCCTGC CCTGAGAGTT CCGGCAGGCA CAGTATGGAA CAGCAAACCC GTCACCACTG TCTTTGATCT CTTTATTGAA AACACAAGCT CGTTCACGCC GGAATGGGCA TCTGCCATCA CCGACATCCC AGCCGAAACC ATCAGGCGCA TCGCACTTGA GTTCGGTCAG GCAAGACCCG CTCTGGTTGA TCCCGGCTGG ATGGGTTCCC GTTACCACCA TGTTATCGCC CAGCGCCGAA TGCAGGCGAT CATTCAAACG CTTGTAGGAG GAATTGACGT TACGGGCGGC TGGCTGATGA ACGGTGAATA CCGCCATAAA GCTGAAAACG CCTGGCACAT GAAGCGCCAC GGGACGGATC AAGCTGAGGA ACCGAAGGAA ATTCCACCCG TCATGCTTCC GGGAATGGCT TTCGCGAACG GCCTTATTGA TATTTTCGCG AATCCCGCGT CCTGGTCCCA CGGCAAACCG GCTCTTTCCT TTGCCTGGGC ACAGGAACAG CAGAAACAGG GCAAGCCCTC TGTATTTCTT CCCGCGATGG CAGACACCGG CCTCCTCGAA GCGGTACGAG GCGACATGGT GTACAATGGA GAGCCTTACA ACATCAAGGC GTTCTTCATG AACGCCGCCA ATCCGATCAG ACACTACTTT CCTGCCGAAC GCTGGGAGGA AATCCTTTCC AGCAGCACTG TAGATCTGGT CGTCACCATT GATGTCCTGC CGTCTGATAC CACAGCGTTT GCCGATGTCA TTCTGCCGAA CCATACCTAT CTGGAACGGA ATGAACCGCT TCTTTATCCG CTCGGCCCGG ACACCAACCT TGGCTTTGCC ACCCGGCTTC GAGCTGTAGA ACCGCTCTAC AATACCCGGG ACGCCGCTGA CATTTTCTGC GAAATCACCG ACCGGATGGG CAAGCTGGAT AACTACCTTG CTGGAATCGC TGAGTATGCC GGGCTCGATG AAGCACTGCT GAAACATGAG ATAAAGAGTG CCCGGGATGC CGGAAAGCCC TATAACGAAG CATTCCTGAA AGCCTCATTC GAAGCACTCG GTCACCTGTC TGAGCATGTC ACCGGCGAAA AACTATCCGG TAGCGAGGTT GAAAAAACAA TTCGGGAAAA AGGGGTGATC ATACTGAAAA ACGCTGAGGA ACTTCTGGCG GAATCGGCTA TGCCTGCAAA GATACCCGTT CCTACCATGT CAGGAAGGCT TGAACTGTTC AGTCCGATTC TTGCTTCATT TACCCGAGCC GCAGGCCCCA ACCCGCTCTG GGACCCGGTA CTGGGCTATG TTCCGCTTTC CCTCTCGGAC GACCGGAACA AAAACAGTCT GGATCGGGAC GAATTCTACT TTACCTATGG AAAGGTTCCC GTTGTCTCTC ACGCTTCGAC CAACAATAAT AATGCACTTC TTTCCTCGCT GACAACACCC AAGAAAGGTG CATTTACCGG CCTCTGGTTA AACAAATCGA GAGCCGAAGA GCTTGGTTTT GAACAGGGGC AGATCATTGA GATAGAGAAT CTGAGGTATA ACAAAATAGT CAGGGCGACG TTGTTCGTAA CGGAAATGAT CCGGCCAGAC ACGGTCTTTC TGCCCTCTGC CTACGGCAGT AAAAACCCGA AACTCGGCAT TGCCGGAGGA AAAGGCACAG CACTGAACGA ACTGATGCCT TACAGTATTG AACCGCTCGC CGCGTCATTT ATGTCGCAGG AGTTTACCGT CCGGATAAGA CCGACATAA
|
Protein sequence | MKKKISRRQF LKAAGVLGGM ALLRPVWDSG REERAVQAQS SSGNVIWVPS ICNFCSSFCD INVAVKTVDG QKRAVKIEGN RNSPLNRGKI CARGQAGLRQ LYDPDRIKEP LIRVEGSKRG EWNFRVATWD EAYDYIASRF KKVNPWEIAM IGGWTACVSY MHFSLPFVRT LEIPNIIASP MQHCVTAGHL GTDLVTGNFN VHDEVLADFE NARYILFSQN NASVAAISTA RAVRFSEARK KGAKVVCLDP RQSELASKAD EWIAIRPGTD HAFFLAMIHT MLREELYNKA FLANHTNAPF LTFKESSGTR ELAAGMGSDG KPDAYYIYDE ISGTVRPVAA YTNRNDRTKN GEPVVPALRV PAGTVWNSKP VTTVFDLFIE NTSSFTPEWA SAITDIPAET IRRIALEFGQ ARPALVDPGW MGSRYHHVIA QRRMQAIIQT LVGGIDVTGG WLMNGEYRHK AENAWHMKRH GTDQAEEPKE IPPVMLPGMA FANGLIDIFA NPASWSHGKP ALSFAWAQEQ QKQGKPSVFL PAMADTGLLE AVRGDMVYNG EPYNIKAFFM NAANPIRHYF PAERWEEILS SSTVDLVVTI DVLPSDTTAF ADVILPNHTY LERNEPLLYP LGPDTNLGFA TRLRAVEPLY NTRDAADIFC EITDRMGKLD NYLAGIAEYA GLDEALLKHE IKSARDAGKP YNEAFLKASF EALGHLSEHV TGEKLSGSEV EKTIREKGVI ILKNAEELLA ESAMPAKIPV PTMSGRLELF SPILASFTRA AGPNPLWDPV LGYVPLSLSD DRNKNSLDRD EFYFTYGKVP VVSHASTNNN NALLSSLTTP KKGAFTGLWL NKSRAEELGF EQGQIIEIEN LRYNKIVRAT LFVTEMIRPD TVFLPSAYGS KNPKLGIAGG KGTALNELMP YSIEPLAASF MSQEFTVRIR PT
|
| |