Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_3720 |
Symbol | |
ID | 5664105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | - |
Start bp | 4525807 |
End bp | 4527519 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641238380 |
Product | formate--tetrahydrofolate ligase |
Protein accession | YP_001503566 |
Protein GI | 157963532 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG2759] Formyltetrahydrofolate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000981333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTCAG ATATTGAAAT TTCGCGCCAA CACACCTGTT TACCTATCAC TGATATCGCT AATAATTTAG GCCTTGAGCC TGATGAACTC TCTCTTTGTG GTGCCAATAA GGCCAAGATT TCGCTGTCGG TCGCTAAACG ACTCGCAGAC AGAGAAAATG CCAAACTGGT GATTGTCACT GCGGTGACGC CAACGCCATT CGGCGAAGGA AAAACCGTAA CAACTATCGG CTTAACCCAA GCCATGAACC TCAACGGCAT TAAAGCCTGC GCCTGTATTC GTCAGCCTAG TATGGGGCCT GTATTTGGCA CTAAGGGCGG CGCCGCGGGT GGCGGTTATG CGCAAGTCGT GCCGATGGAA GAGCTGAACC TACACTTGAC CGGTGATATT CACGCGGTTA GCAGTGCCCA TAATTTAGCG GCAGCCGCAA TCGACGCACG TCTATTCCAT GAAACACGTT TAGGCGCCGA AGCTTTTACT CAAGAATCAG GTTTAACTGC GCTTAATATC GATGCTGAAA ATATCCTTTG GCGCCGAGTG GTTGACCATA ACGAACGCAG CCTAAGACAG ATTAAAGTCG GCTTTGGTGC GGTTAATGGC CCTGTACATG AGTCGGGTTT CGACATCACT GCAGCGTCTG AATTGATGGC GATTCTAGCG CTGAGCCAAG ACTTAAAAGA TCTTAGACAG CGTATTGGTC GCCTCGTATT GGCGCTCAAT AATCAAGGTG AGCCGATCAC AGCCGAAACG CTAGGCGTTG CAGGTGCGAT GACGGTGATT ATGGCCGATG CGATTGAGCC AACCTTGATG CAGACTCTGT CTGGTGATCC ATGCTTTATT CATGCTGGCC CCTTTGCCAA TATAGCGCAT GGTAACTCTT CGATTATTGC CGATACCATC GCGGCTAAGC TTGCCGACGT TGTCGTGACA GAAGCGGGAT TTGGCTCAGA TATGGGCTTT GAAAAGTTCA GTAACATCAA GGTGAGAGAA TCGGGCTACG CTCCGAGCGC ATCTGTAGTG GTAGTGACAC TGAAGGCTCT GAAAGCCAAT AGTGGCATCG AATCTGACCA AGATATTAAC CAGCCAGATA CGCAGCGTCT AAAGGTCGGC TTTGCTAACC TTGAATGGCA TATCAACAAC GTCAGCCAAT ATGGCGTGCC AGTTGTGGTC GCAATTAACC GTTTCCCGAC CGATACCGAC GAGGAACTTG AGTGGCTGAA GCAGGCTATT GAGCAAACTA AGGCCTTCGG CAGTGAGATA AGCGAAGCCT TTGGTAAAGG CGCAGAAGGA GCGACTAAGC TTGCGGATAT GGTTTACCGT GCGACGCAGA CGCCATCTGA CTTTAACTTG CTGTACGAGA GCAATACTAG CCTTGAGTCT AAGCTGATGA CGCTTGCAGA GGTGGGTTAT GGCGCATCGA GCGTCACTCT ATCTGATAAA GCTAAGTCAC AATTAAACTG GCTGGCTAAG CATAACTATG GCGAGCTACC TATCTGTGTC GCTAAGACGC CGATGTCGAT AAGCCACGAT CCAGATATTA AAGGGATCCC GACAGGTTTT GAACTGCCTA TTACTGAGCT AAGGCTTAAT GCTGGGGCAG GCTTTGTCAC GGCGCTTGTG GGCAAAGTGA TGACTATGCC AGGGCTGGGG ATTAAGCCTG GCTATCTCAA TGTCGATATC AACGATGATG GCGAGATTGT TGGATTAGCC TAA
|
Protein sequence | MHSDIEISRQ HTCLPITDIA NNLGLEPDEL SLCGANKAKI SLSVAKRLAD RENAKLVIVT AVTPTPFGEG KTVTTIGLTQ AMNLNGIKAC ACIRQPSMGP VFGTKGGAAG GGYAQVVPME ELNLHLTGDI HAVSSAHNLA AAAIDARLFH ETRLGAEAFT QESGLTALNI DAENILWRRV VDHNERSLRQ IKVGFGAVNG PVHESGFDIT AASELMAILA LSQDLKDLRQ RIGRLVLALN NQGEPITAET LGVAGAMTVI MADAIEPTLM QTLSGDPCFI HAGPFANIAH GNSSIIADTI AAKLADVVVT EAGFGSDMGF EKFSNIKVRE SGYAPSASVV VVTLKALKAN SGIESDQDIN QPDTQRLKVG FANLEWHINN VSQYGVPVVV AINRFPTDTD EELEWLKQAI EQTKAFGSEI SEAFGKGAEG ATKLADMVYR ATQTPSDFNL LYESNTSLES KLMTLAEVGY GASSVTLSDK AKSQLNWLAK HNYGELPICV AKTPMSISHD PDIKGIPTGF ELPITELRLN AGAGFVTALV GKVMTMPGLG IKPGYLNVDI NDDGEIVGLA
|
| |