Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0029 |
Symbol | |
ID | 7316679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 27881 |
End bp | 28831 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643614919 |
Product | Hydroxymethylbilane synthase |
Protein accession | YP_002512120 |
Protein GI | 220933221 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.943678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTGG ACACCCTTCG CATCGCCACC CGCAAGAGCC CGCTGGCCGT CTGGCAGGCG GAACACGTCG CCGCCCTGGT GAAGGCCCGT CACCCCGGGG TGCGGGTCGA GCTGGTGGGC ATGACCACCC AGGGTGACCG CATCCTGGAC ACCCCGCTCG CGAAGGTGGG CGGCAAGGGC CTGTTCGTGA AGGAGCTGGA GACCGGCCTG CTGGAGGGTC GCGCCGATAT CGCCGTGCAC TCCATGAAGG ACGTGCCCAT GGAACTGCCC GAAGGCCTGT GCCTGCCGGT GATCCTGGAC CGGGAGGATC CCCGGGACGC GTTCGTGTCC AACACCTTCA AGAGCCTGGA CGAGCTGCCC CGGGACGCCC GGGTGGGCAC CTCCAGCCTG CGCCGCCAGT GCCAGCTGCG CCACGACCAT CCCCATTTCC AGATCCTGGA CCTGCGCGGC AACGTCAACA CCCGCCTGGC GAAGCTCGAC GCCGGCGAGT TCGACGCCAT CATCCTGGCC GCCGCGGGCC TCAAGCGCCT GGGTTTCGAG GTGCGCATCG CCTCGGAGAT CACCCCGGAG CAGAGCCTGC CCGCCATCGG CCAGGGGGCC ATCGGCATCG AGTGCCGGGA GAACGACCCG GAGGTCATGG CCCTGATCGG CAGTCTGGAC GACCCGGACA CCCACGTGCG CGTGGCCGCC GAGCGGGCCA TGAACGCACG CCTCAACGGC GGCTGCCAGG TGCCCATCGC CGGCTACGCG GAACTGACCG ATGCCGACAC CCTGCGCCTG CGCGGCCTGG TGGGCGAACC GGACGGCAGC CTGATCCTGC GCGCCGAGCT GTCCGGCCCC CGCGCCGAGG CCGAGGCCCT GGGCCGGGCC GTGGCGGATC TCCTGCTCCA CGAGGGCGCC GGCCCGATCC TGGCCGAACT GGGCCTGGGC CCGGACGCGG ACCGGTCATG A
|
Protein sequence | MPLDTLRIAT RKSPLAVWQA EHVAALVKAR HPGVRVELVG MTTQGDRILD TPLAKVGGKG LFVKELETGL LEGRADIAVH SMKDVPMELP EGLCLPVILD REDPRDAFVS NTFKSLDELP RDARVGTSSL RRQCQLRHDH PHFQILDLRG NVNTRLAKLD AGEFDAIILA AAGLKRLGFE VRIASEITPE QSLPAIGQGA IGIECRENDP EVMALIGSLD DPDTHVRVAA ERAMNARLNG GCQVPIAGYA ELTDADTLRL RGLVGEPDGS LILRAELSGP RAEAEALGRA VADLLLHEGA GPILAELGLG PDADRS
|
| |