Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_0556 |
Symbol | fhs |
ID | 5134660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009456 |
Strand | + |
Start bp | 608702 |
End bp | 610450 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640530878 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_001215395 |
Protein GI | 229259761 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.124626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCAG ATATTGAAAT TTGCCGCGCT ACACCATTAG CGCCTATCGA CACCATTGCT CAAAAAGCGG GATTGCACGC GAATGAGTAC GAAAGCCACG GCCAGCATAA AGCCAAAGTG TCACTGCATT GTCTAGAGCG ATTGGCCAAC AAGCCCAAAG GTAAATTCAT TCTGGTCACT GCGATTACCC CAACACCACT GGGTGAAGGT AAAACCGTTA CCACGATTGG TTTAGCACAA GGGTTGGCCA AACTTAATCA CTCGGTCATG GCGTGCATTC GTCAGCCTTC GATGGGGCCG ATTTTTGGGG TAAAAGGGGG CGCTGCGGGT GGTGGTTATT CACAAGTTGC ACCTATGGAA GAGCTCAATC TGCATTTAAC CGGTGATATT CATGCCGTAA CGGCGGCGCA CAACCTTGCG GCAGCTGCGA TTGATGCGCG AATTTATCAC GAGCAGCGCC TCGGCTATGC CGATTTTGAG CGCCGCACCG GCATGCCAGC GCTGCGCATT GACTCCAAAC AGGTCATATG GAAACGCGTG ATGGATCATA ACGATCGCGC GCTGCGCATG GTGACGGTCG GCCGCAATGA ACCGGGAAAA AATATTAATG GTTATGAGCG CGAAGATGGT TTCGATATCT CTGCCGCCTC CGAATTGATG GCGATTCTGG CTCTCGCCTC GGATCTACGT GATTTGCGTC GCCGCATCGG TAATGTGGTG TTGGCTTATG ATTTGGACGG TAATCCGGTA ACTACAGAAG ATCTGAAAGT AGCTGGCGCA ATGGCAGTCA GCATGAAAGA AGCGATTGAG CCGACCTTGA TGCAAACTTT AGAAGGCGTC CCAACACTGA TCCACGCCGG CCCATTTGCC AATATCGCGC ACGGTAACTC CTCGATCATT GCCGATGAAA TTGCCACCCG TTTGGCCGAC TACACCGTGA CCGAAGGCGG TTTTGGCTCC GATATGGGGT TTGAGAAAGC GTGCAACATC AAAGCCAAAG CATCCGGTAA AACACCAGAT TGTGCGGTGA TTGTCGCCAC CTTACGCGGC TTAAAAGCCA ACTCAGGCCT GTATGATTTA CGCCCCGGCC AAGCGGTACC GGATGCCCTA TTCGCGCCAG ACAGCGCCGC TTTGCAAGCC GGTTTTGCAA ACTTGAAATG GCATATTGAT AACGTTAACC AGTATGGTGT GCCTGCCGTG GTAGCGATTA ACCGCTTCCC ACAAGATTGT GCCGAAGAAC TGGAACAACT GGTTAAGCTG ATAGAAGCCC TGCCCAACCG TGTATCGGTA GCCATTTCAG AAGGCTTTGC CAAAGGCGGT GAAGGCACCC AACTCCTTGC CGAAAAAGTG GTTGAGCAGT GTCAACATCC AACGAAATTC ACTCCGCTCT ACCATTCAGG CATACCATTG GATGAAAAAC TCAAAGCGGT CGCGGTAAAA GGTTATGGCG CTGCCGAGAT TGCACTGAAT GATAAAGCCG CACAGCAATT AGCCACACTG CAAGCCCAAG GCTTTGATCA TCTTGCGGTT TGCTTGGCGA AAACACCGCT GTCGATTTCT ACCGATCCCG CAATCAAAGG CGCGCCACGT GATTTTATCG TACCGATCCG CGAGCTGCGT TTATGTGCAG GCGCCGAATT TGTCTACGCC TTGTGTGGCA GTGTAATGAC CATGCCCGGC TTACCGGAAA AACCTTCCTT TATGGCGCTC GATATCGATC AGCACGGCAA CATCGTCGGC TTAAGTTAA
|
Protein sequence | MLPDIEICRA TPLAPIDTIA QKAGLHANEY ESHGQHKAKV SLHCLERLAN KPKGKFILVT AITPTPLGEG KTVTTIGLAQ GLAKLNHSVM ACIRQPSMGP IFGVKGGAAG GGYSQVAPME ELNLHLTGDI HAVTAAHNLA AAAIDARIYH EQRLGYADFE RRTGMPALRI DSKQVIWKRV MDHNDRALRM VTVGRNEPGK NINGYEREDG FDISAASELM AILALASDLR DLRRRIGNVV LAYDLDGNPV TTEDLKVAGA MAVSMKEAIE PTLMQTLEGV PTLIHAGPFA NIAHGNSSII ADEIATRLAD YTVTEGGFGS DMGFEKACNI KAKASGKTPD CAVIVATLRG LKANSGLYDL RPGQAVPDAL FAPDSAALQA GFANLKWHID NVNQYGVPAV VAINRFPQDC AEELEQLVKL IEALPNRVSV AISEGFAKGG EGTQLLAEKV VEQCQHPTKF TPLYHSGIPL DEKLKAVAVK GYGAAEIALN DKAAQQLATL QAQGFDHLAV CLAKTPLSIS TDPAIKGAPR DFIVPIRELR LCAGAEFVYA LCGSVMTMPG LPEKPSFMAL DIDQHGNIVG LS
|
| |