Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3110 |
Symbol | |
ID | 7317340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 3258095 |
End bp | 3259321 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643618009 |
Product | Formamidase |
Protein accession | YP_002515166 |
Protein GI | 220936267 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.441852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAA CACTGATCAG TTACGATTTC AGCAAGCCAC CCGAGGAGCA GGACATCAAG CCACACAACC GGTGGCATCC TGACATTCCC ATGGCAGTCC GGGTCAAGCC CGGCGCCGAG TTCCGCCTGC AGTGCTTCGA CTGGACCGGC GGCCAGATCG AGAACAACGA CAGCGCCAAC GACATTCGCG ACGTGGACCT GACCAAGGTG CACTACCTGT CCGGCCCCGT GGGTATCGAG GGTGCCGAGC CCGGCGACCT GCTGGTGGTG GACATCCTCG ACGCCGGCCC GCTGCCGGAT CAGCTGTGGG GTTTCAACGG CATCTTCGCC CGGGAAAACG GCGGCGGCTT CCTGGTCGAT CATTTCCCCG AGGCACGCAA GTCCATATGG GACTTCCACG GTATCTACAC CAGCTCCCGG CACATCCCCA AGGTGCGCTT CCCGGGCATC ATCCACCCCG GCCTGATCGG TACCCTGCCT TCCAAGGAAC TGCTCAAGCA GTGGAACGAC CGGGAAAAGG CCCTGGTGGA CACCGATCCG AATCGTGTGC CGCCCCTGGC GACCCTGCCC TACGAAGAGA CCGCCCTGAT GGGCAGCATG AAGGGCGAGG AGGCCAAGGC CGCCGCCCGT GAAGCGGCCC GTACCGTGCC GCCCCGCGAA CACGGCGGCA ACTGCGACAT CAAGAACCTG TCCCGCGGTT CCCGGGTGTA CTTCCCCGTG TATGTGAAGG AGGCCGGCCT GTCCATGGGC GACCTGCACT TCTCCCAGGG CGACGGCGAG ATCACCTTCT GCGGCGCCAT CGAGATGGCG GGCTTCATCG AGCTGCGTGT CAACCTCATC AAGGGCGGCA TGAAGAAGTA CAACATCAGC AGCCCGATCT TCCAGCCCAG CAAGATCGAT CCCCAGTTCA AGGACTACCT GATCTTCGAA GGCATCTCCG TGGACGAGGA CGGCAAGCAG CACTACCTGG ATGCGCACAT TGCCTATCGT CGCGCCTGTC TGGCCGCCAT CGACTACCTG AAGAACTTCG GCTACTCCGG TGAGCAGGCC TACGCCATTC TGGGCACGGC GCCGGTGGAG GGACATATCA GCGGTATCGT GGACATCCCC AACGTCTGCG CCACCCTGTG GCTGCCCACC GAGGTGTTCG AGTTCGACAT CCATCCCACG GATGCGGGCC CGGCCATCGA GATCCCCTCG GGTATCGATA CCTCGCGCAC CAGCTGA
|
Protein sequence | MPETLISYDF SKPPEEQDIK PHNRWHPDIP MAVRVKPGAE FRLQCFDWTG GQIENNDSAN DIRDVDLTKV HYLSGPVGIE GAEPGDLLVV DILDAGPLPD QLWGFNGIFA RENGGGFLVD HFPEARKSIW DFHGIYTSSR HIPKVRFPGI IHPGLIGTLP SKELLKQWND REKALVDTDP NRVPPLATLP YEETALMGSM KGEEAKAAAR EAARTVPPRE HGGNCDIKNL SRGSRVYFPV YVKEAGLSMG DLHFSQGDGE ITFCGAIEMA GFIELRVNLI KGGMKKYNIS SPIFQPSKID PQFKDYLIFE GISVDEDGKQ HYLDAHIAYR RACLAAIDYL KNFGYSGEQA YAILGTAPVE GHISGIVDIP NVCATLWLPT EVFEFDIHPT DAGPAIEIPS GIDTSRTS
|
| |