Gene Sbal223_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3748 
Symbol 
ID7087412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4451274 
End bp4453016 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content51% 
IMG OID643462628 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_002359649 
Protein GI217974898 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAG ATATGGATAT TTCGAGTCGC GCAAGCCTTA AAAACATCAC CGAACTGGGC 
GCCGACTTAG GCCTGTTACC CGAAGAGATG ATGCTATTTG GTCACACCAA AGCCAAGGTG
GAGCTTAGCG TGTTGCAGCG TTTAGCGGGG CAACGCAAAG GTAAATTGAT TATCGTCACC
GCAGTTACGC CTACGCCCCA CGGCGAAGGT AAAACGGTGA CGAGCATTGG CTTAACGCAA
TCGTTAAATG CCATAGGCCA AAAGGCCTGT GCTTGTATTC GTCAGCCCAG CATGGGACCT
GTGTTTGGGG TGAAAGGCGG TGCTGCGGGT GGCGGTTATG CGCAAGTGGT GCCCATGCAG
GAGATGAACT TACATCTCAC GGGTGACATT CATGCCGTCA GTAGTGCCCA CAATTTAGGG
GCCGCGGCGA TTGCGGCGCG GCTATTCCAC GAGGCGCGCT TAGGTAAAAC AGAATTTGAA
GCGCAATCTG GACAAGCGTT TTTGGATATA GCTCCCAACG AGATCCGCTG GCACCGAGTG
GTCGATCATA ATGATCGCTG CCTTAGGCAA ATTCATGTGG GCTTAGGCGA TAATAACGGC
CCGGAGTACG GATCTAGTTT TGATATTACT GCCGCCTCTG AGTTGATGGC GATTTTAGCC
TTAAGCCATG ATTTGGCGGA TATGCGGGCG CGGATCGGTC GTTTAGTGCT GGCCTTAAAT
ACCCAAGGGC AAGTGATTAC CGCCGAAGAC TTAGGTGTGG CTGGGGCGAT GACGGCCATC
ATGGCCGATG CGATAAAACC GACGTTAATG CAGACCTTAA ATGGCTCGCC TTGTCTGATC
CATTCGGGAC CCTTTGCCAA TATTGCCCAC GGTAACTCGT CGATCATTGC CGATGATATC
GCCCTCAGGC TGGCGGATTT TGTGGTGACT GAAGGTGGTT TTGGCTCGGA TATGGGCTTT
GAAAAGTTCT GTAATATCAA AGTACGTCAA TCGGGTCAGG CGCCAGCTGC CGCTGTGTTG
GTGACGACTT TAAAAGCGCT GAAGGCCAAT AGTGGCTTGG CAACTGAGGT GGATAGCAAT
GCACCAAATA TCCATGTGCC AAACATCAAT GCTCCGGATC AGGCTCGACT CGAAGCGGGC
TTTGCAAATT TAAACTGGCA TATCAACAAT GTGGCGCGTT ATGGCATTCC TGTGGTGGTG
GGCATTAACC GTTTCGCCAC TGATTCCGAT GCTGAGTTGC AATGGCTAAT GGAAGCTGTC
AACGCGAGTG CCGCCTTTGG CTGTGAAATT AGCGATGCTT TCAGCCAAGG GGAAGCGGGC
GCTATCGCCT TAGCACGAAC TGTGGTTCGC GCCGCTGAAA CTGAAAGTCA GTTCAAACTG
TTATATCCAG ATGAAGCGTC ATTAGAGGCA AAGTTATCCA CCTTAGCCGA AGTGGGCTAT
GGCGCTTCAG GCGTGAGTCT TTCTATCGAG GCGAAACAGC AGGCGCAGCA ACTGACCGCC
CTTGGCTATG GGCATTTGCC TTTGTGTATG GCCAAAACGC CGTTATCCAT TAGCCACGAT
CCTAGCTTAA AGGGTGTTCC TAAGGATTTT GTGGTGCCAG TGCGAGAGTT AGTGTTACAT
GCGGGAGCGG GATTTATAAC TGCCTTAGTG GGGAATGTGA TGACTATGCC AGGCCTTGGG
CTTAAGCCGG GTTACTTGAA AATCGATATT GATGCCAAAG GTGAGATAGT CGGTTTAGGC
TAA
 
Protein sequence
MLTDMDISSR ASLKNITELG ADLGLLPEEM MLFGHTKAKV ELSVLQRLAG QRKGKLIIVT 
AVTPTPHGEG KTVTSIGLTQ SLNAIGQKAC ACIRQPSMGP VFGVKGGAAG GGYAQVVPMQ
EMNLHLTGDI HAVSSAHNLG AAAIAARLFH EARLGKTEFE AQSGQAFLDI APNEIRWHRV
VDHNDRCLRQ IHVGLGDNNG PEYGSSFDIT AASELMAILA LSHDLADMRA RIGRLVLALN
TQGQVITAED LGVAGAMTAI MADAIKPTLM QTLNGSPCLI HSGPFANIAH GNSSIIADDI
ALRLADFVVT EGGFGSDMGF EKFCNIKVRQ SGQAPAAAVL VTTLKALKAN SGLATEVDSN
APNIHVPNIN APDQARLEAG FANLNWHINN VARYGIPVVV GINRFATDSD AELQWLMEAV
NASAAFGCEI SDAFSQGEAG AIALARTVVR AAETESQFKL LYPDEASLEA KLSTLAEVGY
GASGVSLSIE AKQQAQQLTA LGYGHLPLCM AKTPLSISHD PSLKGVPKDF VVPVRELVLH
AGAGFITALV GNVMTMPGLG LKPGYLKIDI DAKGEIVGLG