Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2184 |
Symbol | |
ID | 6375878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2361749 |
End bp | 2362789 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684671 |
Product | TIM-barrel protein, nifR3 family |
Protein accession | YP_001960570 |
Protein GI | 189501100 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.067096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.397805 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAG GAAGTCTGGA CATAGAGCGC CCGGTGATTC TCGCTCCCAT GGAAGATGTT ACCGACAGAT CTTTCAGAAA GATCTGCAAA CGGTTTGGTG CAGATATTGT CTATACTGAG TTTGTCAGCG CGGAGGCTCT CCGCAGAGGT GTCGGGAAAT CTATTCAAAA GATGCTTTTC GAGGAGAGCG AACGTCCAGC AGTCATTCAG ATCTTCGGAA ATTCCGGGGA GGCAATGGCC GAGGCGGCGG TCATCGCCGC ATCGGCCAAA CCGGACTACC TTGATATCAA TTTCGGATGT CCTGCCAAGA AGGTCGCAGG TAAAGGGGCG GGAGCCGCTT TGCTCAGGGA ACCCGAAAAA ATGGCGGCTA TTACAGCTGC GGTGGTCAAG GCGGTATCGA TTCCCGTGAC GGTGAAAACC CGGATCGGCT GGGACAGGGA TTCAGTCAAT ATTCTTGATA TTATCCCCAG GCTTGAAGAT GCGGGTATTG CCGCGATTGC CGTGCATGGT CGAACACGAA GCGAAATGTA CAAAGGAAGA GCTGACTGGG ACTGGATTGC CAGAGTGAAG GAGCATGCAA AGATTCCAGT TATCGCCAAC GGTGATATCT GGTCGCCTCA GGATGCTCTT GCCATGTTCA GCCACACCGG AGCGGACGGT ATCATGATAG GGCGTGGCGC AATCGGTAAT CCGTTTATAT TCAGGCAGGT GAAGGAACTG CTGCAAGGCG GCGCCGTGAC GACGTTGCCG GATTTCAGGG ACAGGATTTC GGTAGCTATA GAGCATCTCT CGCTTTCCGT GGAGCACAAG GGAGAAAAGT ACGGAACTCT TGAAATGAGA AGACATTATT CCACCTACCT TAAAGGGCTC CCGAGGGTTT CCAGAGTCAG GGACAAGCTT GTCAGGGAGG AAAAGTGGCA GCAGGTTATC GAGATACTAA GGGCGTACGA GGTTGAGTGT GAGGGTTACG AACGTGAGGG AAAAATCCGG GAGTATGCTG AATTTCTCAA TGATCATTCG AAGCGTCTGG TGCTTAAGTG A
|
Protein sequence | MRIGSLDIER PVILAPMEDV TDRSFRKICK RFGADIVYTE FVSAEALRRG VGKSIQKMLF EESERPAVIQ IFGNSGEAMA EAAVIAASAK PDYLDINFGC PAKKVAGKGA GAALLREPEK MAAITAAVVK AVSIPVTVKT RIGWDRDSVN ILDIIPRLED AGIAAIAVHG RTRSEMYKGR ADWDWIARVK EHAKIPVIAN GDIWSPQDAL AMFSHTGADG IMIGRGAIGN PFIFRQVKEL LQGGAVTTLP DFRDRISVAI EHLSLSVEHK GEKYGTLEMR RHYSTYLKGL PRVSRVRDKL VREEKWQQVI EILRAYEVEC EGYEREGKIR EYAEFLNDHS KRLVLK
|
| |