Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0263 |
Symbol | |
ID | 5732158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 308571 |
End bp | 310058 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277387 |
Product | glutamate formiminotransferase |
Protein accession | YP_001543043 |
Protein GI | 159896796 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3404] Methenyl tetrahydrofolate cyclohydrolase [COG3643] Glutamate formiminotransferase |
TIGRFAM ID | [TIGR02024] glutamate formiminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0569087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTTGG TCGAAAGTAT TATGAATTTC AGCGAAGGTC GGCGCACCGA AGTTGTGCAC GCCATTCGTG ATGCAATTAC GGCTGTTGCC GGCGTGCAAT TGCTCGATGT TCAATCCGAC GCTGATCATA ATCGCACGGT GATTAGTTTT GCGGGCGAGG CTGAAGCAGT TGGCGAAGCT GCTTTCCAAG CAACCCGCAC CGCCCAAGGC TTAATTAATT TGGATACCCA TCGCGGCGAA CACCCACGCA TCGGCGCGAC CGATGTCTTG CCATTTGTGC CACTTGGCCA AACCACGATG AAACAGTGTG TGGCCTTGGC TCGCAAAGTT GGCAAGCGCA TTGGTGATGA ATTGGGGATT GCGGTTTATT TGTATGAAGA GGCTGCGACC CGCCCCGAAC GTCAAAATTT GGCCGATGTG CGTAAGGGCG AATATGAGGC TTGGCGCAAA GCCATTGGGG TTGATCCGGC GCGGGAGCCA GATTTTGGCC CAGCCGTGGC GACACCTGCA GGCGCAACCG TGGTTGGGGC ACGCCAGCCA TTGATTGCCT ACAACATCTA TTTAAATACC ACCGATGTAG AAATTGCCAA AAAAATCGCT AAATCGATTC GCTATCTTGG CGGTGGCTTG CGCTATGTCA AAGCTTTGGG CTTGTTGGTC GATGGTCGCG CTCAAATCTC GATGAACTTG GTTAATTTCC GTGGAACGCC AATTCATCGA GTGCAGGAGT TAGTACGCGC CGAGGCCATG CGCTATGGCG TGACGATTAC TGAGGGCGAA GTTATTGGGC TTGTGCCGCA AGATGCGCTG GTTGATGCTG CTGAGCATTA TCTGCAACTC AATCGTTTTC GCCGCGACCA AGTGCTTGAA TCGAAGTTGG CCGCGCCAAG TGCTGGCGAT GACTGGTTGC CAACCAACAC GTTCCAAGCC TTTGCAGCTG GGACACCAAC GCCTGGTGGT GGTTCGGCGG CGGCCTTAGC TGGGGCTTTG GCTGGCTCGT TGGGCCAAAT GGTGGCTAAT TTAACCGTCA GCCGCAAAAA ATATGCAGCG GTCAAGCCCA GCATGCAAGC GGCCTTGGAG CGCTTGAGCG AAGCAACCAC CAGCTTGGGC AAATTGGCTT TGTCCGATAG TGCCGCATTT AACGCCATCA GCGTCACGCG TAAATTGCCT GAAGAGCAGG CTGACCGAGC GCAACAATTG GCGGCGGCGA TTGTGCATGC CTGTGAAGTT CCCTTGCAAG TGGCCCAACA AGCTGCCAGT TTGTTTGATG ATTTATATCT GTTGGCGACC CAAGGCAACG TCAATGCCCG CACCGATGCC CAAGTCGGCG GCTATTTGGC CTATGCCGCC GTCAATGGGG CTGGCTTAAA TGTGTTGGTC AATCTTGGCG ATTTAAGCGA TGCTCAATTG CGTGAACAAT TCAGCGCGGC GGTTGCCAAG CTGCGCCAAC AAGCTGAGCA AGGCTTGCAA AAACTGACGA CACTCTAG
|
Protein sequence | MGLVESIMNF SEGRRTEVVH AIRDAITAVA GVQLLDVQSD ADHNRTVISF AGEAEAVGEA AFQATRTAQG LINLDTHRGE HPRIGATDVL PFVPLGQTTM KQCVALARKV GKRIGDELGI AVYLYEEAAT RPERQNLADV RKGEYEAWRK AIGVDPAREP DFGPAVATPA GATVVGARQP LIAYNIYLNT TDVEIAKKIA KSIRYLGGGL RYVKALGLLV DGRAQISMNL VNFRGTPIHR VQELVRAEAM RYGVTITEGE VIGLVPQDAL VDAAEHYLQL NRFRRDQVLE SKLAAPSAGD DWLPTNTFQA FAAGTPTPGG GSAAALAGAL AGSLGQMVAN LTVSRKKYAA VKPSMQAALE RLSEATTSLG KLALSDSAAF NAISVTRKLP EEQADRAQQL AAAIVHACEV PLQVAQQAAS LFDDLYLLAT QGNVNARTDA QVGGYLAYAA VNGAGLNVLV NLGDLSDAQL REQFSAAVAK LRQQAEQGLQ KLTTL
|
| |