Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2008 |
Symbol | |
ID | 3835433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2320063 |
End bp | 2321928 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637826108 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_427095 |
Protein GI | 83593343 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.578808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCGC CGTTTTTATC CTCCCTATCC CCCACCAGTC CGCTGGCCAG CGCCACGGCG CCCTTTCCCG GCTCGCGCAA GGTTTATGCC CGCCCGGCCG ACGCCCCCCA TTTGCGCGTG CCCTTTCGCG AGATCATCCT GTCCGACCCG GGCGAAGCCC CGGTGCGGGT GGCCGACCCT TCGGGCCCCT ATAGCGATCC CGAGGCGACG ATCGATCTGC GCCAGGGATT GGCCCGCCAT CGGGCGTCTT GGGCCAGCGC CCGGGGCAAT TCCACGGTCA CGGCCGGCCG CCCCGCCCCG AGTGAAGGCG ATTTCGAGGC CTTTCCCCTG ACCTACGCCC CCTTGCGCCG CCGCGATGAG ACCCCCTTCA CCCAACTGGA ATACGCCCGC GCCGGAGTGA TCACCGACGA GATGATCTAT GTGGCGACGC GGGAGAACCT GGGCCGGGAC AGCGCCGTGG CCGGGGCCTG CGCGCGGTTG GCGGGCGGCG AGGCCTTTGG CGCCGCCCTG CCCGCCCATG TCACGCCCGA GTTCGTCCGC GCCGAGATCG CCGCCGGGCG GGCGATCATC CCGGCCAACA TCAACCATCC CGAGCTTGAA CCGACGATCA TCGGCCGCAA CTTCCTGGTC AAGGTCAACG CCAATATCGG CAATTCGGCC CTGGGATCGT CGATCGAGGA CGAGGTGGCA AAGCTGGTCT GGGCCATCCG CTGGGGGGCC GACACGGTGA TGGATCTGTC GACGGGCAAG GCCATCCACG CCACCCGCGA ATGGATCTTG CGCAACAGTC CGGTGCCCAT CGGCACCGTT CCCCTGTATC AGGCTTTGGA AAAGGTGGGC GGCGACGCCA CCCGCCTTGA CTGGGCGGTG TTCGAAGACA CCCTGATCGA ACAATGCGAA CAGGGCGTTG ATTATTTCAC CATCCATGCC GGGGTGCGGC TGGCCCATAT TCCGCTGACC GCGTCGCGCA CCACCGGCAT CGTCAGTCGC GGCGGTTCGA TCCTGGCCAA ATGGTGCTTG TCCCACCACC GCGAGAATTT CCTTTATGAG CGCTTCGCCG ATATCTGCGC CATCCTGCGC CGCTATGACG TGGCCTTTTC GCTAGGCGAC GGCCTGCGCC CGGGATCGGT GGCCGATGCC AATGACGCCG CCCAATTCGC CGAACTCGAC ACTTTGGGCG CCTTGACGGC GGTGGCCTGG GAGCACGGCT GTCAGGTGAT GGTCGAAGGC CCGGGCCATG TGCCCATGCA CAAGATCAAG GCCAATATGG ACCGCCAACT CGCCACCTGC GGCGAGGCGC CGTTCTATAC CCTGGGGCCC TTGACCACCG ATATCGCGCC CGGCCACGAC CACATCACCT CGGCGATCGG CGCGGCGATG ATCGGCTGGT TCGGCACCGC CATGCTGTGC TACGTCACCC CCAAGGAACA CCTGGGGCTG CCCGATCGCG CCGACGTCAA GGCCGGGGTG ATCGCCTATA AGCTGGCCGC CCATGCCGCC GATATCGCCA AGGGGCATCC CGCCGCCCAG CTGCGCGACG ACGCCATCAG CAGGGCGCGC TTCGATTTCC GCTGGAGCGA CCAGTTCAAC CTCGGCCTCG ATCCCGAAGG CGCGCGGGCC TTCCACGACG AAACCCTGCC CCATGCCGCC CATAAGACGG CGCATTTCTG TTCGATGTGC GGGCCGAAGT TCTGTTCGAT GAAGATCAGC CATGATATCC GCGACGGCGC CCTCGAGGGG GCGGACGCCC TGACCCAGGC CGGACTGGAC CAGATGAGTG CGACCTTCCG CGCCAGCGGC GGCGAGGTCC ACCTTGATGC GCAAGCTCTC GACGCCCTCG CCTGGGAGGG GAAACCCGCG CGATAA
|
Protein sequence | MTAPFLSSLS PTSPLASATA PFPGSRKVYA RPADAPHLRV PFREIILSDP GEAPVRVADP SGPYSDPEAT IDLRQGLARH RASWASARGN STVTAGRPAP SEGDFEAFPL TYAPLRRRDE TPFTQLEYAR AGVITDEMIY VATRENLGRD SAVAGACARL AGGEAFGAAL PAHVTPEFVR AEIAAGRAII PANINHPELE PTIIGRNFLV KVNANIGNSA LGSSIEDEVA KLVWAIRWGA DTVMDLSTGK AIHATREWIL RNSPVPIGTV PLYQALEKVG GDATRLDWAV FEDTLIEQCE QGVDYFTIHA GVRLAHIPLT ASRTTGIVSR GGSILAKWCL SHHRENFLYE RFADICAILR RYDVAFSLGD GLRPGSVADA NDAAQFAELD TLGALTAVAW EHGCQVMVEG PGHVPMHKIK ANMDRQLATC GEAPFYTLGP LTTDIAPGHD HITSAIGAAM IGWFGTAMLC YVTPKEHLGL PDRADVKAGV IAYKLAAHAA DIAKGHPAAQ LRDDAISRAR FDFRWSDQFN LGLDPEGARA FHDETLPHAA HKTAHFCSMC GPKFCSMKIS HDIRDGALEG ADALTQAGLD QMSATFRASG GEVHLDAQAL DALAWEGKPA R
|
| |