Gene SAG1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1055 
Symbolfhs 
ID1013859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1065109 
End bp1066779 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content38% 
IMG OID637316237 
Productformate--tetrahydrofolate ligase 
Protein accessionNP_688064 
Protein GI22537213 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAG ATATTGAAAT AGCGCAAAGC GTCGCCCTAA AGCCTATTGC TGAGATTGTT 
GAGCAAGTAG GAATTGGTTT TGATGATATA GAATTGTATG GTAAATACAA AGCAAAATTA
TCTTTTGATA AGATTGAAGC TGTAAAATCT CAAAAAGTAG GAAAGTTAAT TTTAGTAACT
GCTATTAATC CGACACCAGC AGGTGAGGGA AAATCCACTA TGAGCATTGG ATTAGCTGAT
GCTTTAAATA AAATTGGTAA AAAAACGATG ATTGCTCTTC GCGAACCTTC TTTAGGTCCT
GTAATGGGTA TTAAAGGAGG AGCTGCTGGA GGTGGATACG CACAAGTACT ACCAATGGAA
GATATTAATT TGCATTTTAC TGGTGATATG CACGCAATCA CAACTGCTAA TAATGCTTTA
TCAGCTCTAC TTGATAATCA TATTCATCAA GGCAATGAAT TAGATATTGA CCAACGTCGA
GTGATTTGGA AGCGTGTAGT TGATCTTAAT GATCGTGCTC TTCGTCAGGT GATCGTTGGC
TTAGGAAGCC CTGTCAATGG GATACCACGT GAAGATGGCT TTGATATTAC TGTTGCTTCT
GAAATAATGG CTATTCTATG TTTAGCTACT GACCTATCAG ATTTAAAGAA ACGCCTGTCA
AATATTGTTG TGGCTTACTC AAGGAACCGT AAACCAATCT ATGTTAAAGA TTTGAAGATT
GAAGGAGCTC TCACACTTAT CCTAAAAGAT ACTATCAAAC CTAATTTGGT TCAAACCATT
TATGGGACAC CGGCTCTTGT CCATGGGGGA CCATTTGCTA ATATTGCACA TGGATGTAAT
TCTGTCTTAG CGACTTCAAC AGCTCTACGC TTAGCAGATT ACGTTGTTAC TGAAGCGGGT
TTTGGAGCAG ACTTGGGCGC TGAAAAGTTT CTTGATATAA AAACGCCAAA CCTTCCGACT
TCTCCCGATG CGATTGTTAT TGTTGCTACA TTGCGTGCAT TGAAAATGCA TGGAGGTGTT
TCAAAAGAAG ATCTCTCACA AGAGAACGTC GAAGCAGTTA AAAGAGGTTT TACAAATCTC
GAGCGTCACG TTAACAATAT GCGTCAATAT GGTGTTCCGG TAGTTGTAGC TATCAATCAG
TTTACTGCAG ATACAGAAAG TGAAATTGCA ACTCTTAAAA CCTTATGTAG TAATATTGAT
GTGGCAGTTG AATTAGCAAG TGTGTGGGAA GATGGAGCAG ACGGTGGTCT TGAACTTGCA
CAGACTGTTG CTAATGTGAT TGAAACACAA TCATCCAATT ACAAACGCTT ATATAATGAT
GAAGACACTA TTGAAGAAAA AATTAAAAAA ATTGTTACTA AAATATATGG TGGTAATAAA
GTTCATTTTG GACCTAAGGC ACAAATACAA TTAAAAGAGT TTAGTGACAA TGGCTGGGAC
AAGATGCCTA TTTGTATGGC AAAGACACAA TATAGCTTTT CTGATAATCC AAATTTACTT
GGTGCTCCAA CTGACTTTGA TATAACTGTT CGTGAATTTG TTCCAAAAAC AGGAGCTGGT
TTTATAGTTG CTCTTACTGG AGACGTATTG ACAATGCCTG GTTTACCTAA AAAACCTGCA
GCTCTCAATA TGGATGTTTT AGAAGACGGT ACAGCCATTG GATTATTTTA A
 
Protein sequence
MKTDIEIAQS VALKPIAEIV EQVGIGFDDI ELYGKYKAKL SFDKIEAVKS QKVGKLILVT 
AINPTPAGEG KSTMSIGLAD ALNKIGKKTM IALREPSLGP VMGIKGGAAG GGYAQVLPME
DINLHFTGDM HAITTANNAL SALLDNHIHQ GNELDIDQRR VIWKRVVDLN DRALRQVIVG
LGSPVNGIPR EDGFDITVAS EIMAILCLAT DLSDLKKRLS NIVVAYSRNR KPIYVKDLKI
EGALTLILKD TIKPNLVQTI YGTPALVHGG PFANIAHGCN SVLATSTALR LADYVVTEAG
FGADLGAEKF LDIKTPNLPT SPDAIVIVAT LRALKMHGGV SKEDLSQENV EAVKRGFTNL
ERHVNNMRQY GVPVVVAINQ FTADTESEIA TLKTLCSNID VAVELASVWE DGADGGLELA
QTVANVIETQ SSNYKRLYND EDTIEEKIKK IVTKIYGGNK VHFGPKAQIQ LKEFSDNGWD
KMPICMAKTQ YSFSDNPNLL GAPTDFDITV REFVPKTGAG FIVALTGDVL TMPGLPKKPA
ALNMDVLEDG TAIGLF