Gene Haur_1499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1499 
Symbol 
ID5733384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1746642 
End bp1748354 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content52% 
IMG OID641278637 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001544271 
Protein GI159898024 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.441254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTA GTTTACAAAT CGCTGCCGAG GCTACGCCAC GCCCCATCAC CCAAATTGCC 
GAAGAATTAG CGATTGCTGA GCAATTTGTC GAACCGTATG GCCGCTACCG TGCCAAAATT
AACCTTGATC TGCTTGATGC GAGCCATGAT CGGCCTCGCG GCAAGCAGAT TTTAGTGACC
GCCATGACTC CAACACCACT TGGCGAGGGC AAAACTGCCA CGACGATCGG CCTTGGAATG
GCCTTAAGTC GCTTGGGCAA ACGCGCCATC TGCACGCTGC GCCAAAGCTC GCTTGGCCCA
GTTTTTGGGA TTAAAGGTGG TGGCTCAGGT GGCGGCTATT CGCAAGTTAT CCCCTTAGAA
GATAGCTTGA TGCACTTAAC TGGTGATATT CACGCCGTGA CCCAAGCCCA CAACCAAATC
GCCGCCATGA CCGACAATAG TTGGTATCAA AAAAATCGGC TGGGCATCGA CCCTGAGCAA
ATTCAGATTC GGCGAGTGCT AGATGTCAAT GATCGCTTTT TGCGCTCGAT CACAATCGGC
CAAGGCGGTT CGCAACATGG CATTCCACGC CAAACGGGCT TCGATATTAC TGCTGCTAGC
GAATTAATGG CTATTTTAGC CTTGGTCAGT GGCGAAAACC ATGCCGATGT GATGCGCGAT
CTCCGCCAAC GCATCGGGCG CATGGTGGTG GCGTTCACTC GTCAAGGCCA ACCAATTACT
GCCGATGATA TTCAGGCGGC GGGTGCAGCC ACGGTGATTA TGCGCAATGC CATTCATCCA
ACCTTGATGC AGACAATTGA AAATACGCCT GTGTTGATGC ATGGCGGGCC ATTTGCCAAT
ATCGCTCACG GCAACGCCAG CGTCGTCGCC GATCAAGTTG GCCTGCGGAT CGCCGATTAT
GTGGTGACCG AGGCTGGTTT TGCCATGGAT ATGGGCGGCG AGAAGTTTTT CGATATCAAA
TGTCGCGCCT TTGATGCCAA ACCTGCGGTC GTGGTGTTGG TCGCTACAAT TCGTGCGCTC
AAAGCTCACA GCGGGCGCTG GAATATCAAA CCAGGTCGCG ATTTGCCCAC CGATTTGTTG
CAAGAAAATC CTGATGCGGT TTATGCAGGC GGGGCCAATC TGCAAAAGCA TATTCGCAAT
GCCCAATTAT TTGGCCTGCC AGTTGTCGTT GCGCTTAATT CGTTCCCTGA TGATCATCCC
TCGGAAATCG AGGCGGTACG CGAAATCGCG ATGAGTGCCG GAGCCTTTGA TGTAGCGGTG
AGCAAGGTAT TTAGCCAAGG CGGGGTTGGC GGCGAAGAAT TGGCAGAAAA AGTGCTAGCC
GCAATTGACC AAGCAGGCCA AGCCCAATTT TTATACGAAC TTGAGCAGCC GTTGACCGCT
AAGATTGCCA CAATTGCCAC CAAAATCTAC GGAGCAGCGG AAGTTAGTTA TAGCGAAGCA
GCCAGCGAAC AATTAGCCAA ACTTGAGGCC AATGGCTTTG GCAATTTGCC GATTTGTATG
GCCAAAACTC ACTTGAGCAT CAGCCATGAT CCGGCGCTCA AAGGTGCGCC AACGGGCTAT
AGCTTCCCAA TTCGTGAAGT GCGGGCCAGC ATTGGAGCAG GCTTTATCTA CCCCATTGCT
GGCGATATGA TGACCATGCC AGGGCTTAGC GCTAACCCTG CTGCCCAACA AATTGATATC
GATGAACATG GCAATACAGT TGGCTTATTC TAG
 
Protein sequence
MKTSLQIAAE ATPRPITQIA EELAIAEQFV EPYGRYRAKI NLDLLDASHD RPRGKQILVT 
AMTPTPLGEG KTATTIGLGM ALSRLGKRAI CTLRQSSLGP VFGIKGGGSG GGYSQVIPLE
DSLMHLTGDI HAVTQAHNQI AAMTDNSWYQ KNRLGIDPEQ IQIRRVLDVN DRFLRSITIG
QGGSQHGIPR QTGFDITAAS ELMAILALVS GENHADVMRD LRQRIGRMVV AFTRQGQPIT
ADDIQAAGAA TVIMRNAIHP TLMQTIENTP VLMHGGPFAN IAHGNASVVA DQVGLRIADY
VVTEAGFAMD MGGEKFFDIK CRAFDAKPAV VVLVATIRAL KAHSGRWNIK PGRDLPTDLL
QENPDAVYAG GANLQKHIRN AQLFGLPVVV ALNSFPDDHP SEIEAVREIA MSAGAFDVAV
SKVFSQGGVG GEELAEKVLA AIDQAGQAQF LYELEQPLTA KIATIATKIY GAAEVSYSEA
ASEQLAKLEA NGFGNLPICM AKTHLSISHD PALKGAPTGY SFPIREVRAS IGAGFIYPIA
GDMMTMPGLS ANPAAQQIDI DEHGNTVGLF