Gene BURPS1106A_A2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2303 
SymbolfolC 
ID4905274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2281841 
End bp2283208 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content68% 
IMG OID640145408 
ProductFolC bifunctional protein 
Protein accessionYP_001076336 
Protein GI126456914 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.966782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTTTTG GCGTATTCGC CTCGATGCGC CGCTTTGATT CGAAGTTGGA TTCGATGAGC 
ACTTTTCCCA CTCTCGACGC GTGGCTTTCG CACCTCGAAA GCGCGCACCC CGTCGGCATC
GACATGGGCC TCGCCCGCAT CGGCCAGGTC AAGGACGCGC TGAAGCTCGC ATTCGCGTGC
CCGGTGATCA CGGTCGGCGG CACGAACGGC AAGGGCTCGA CCTGCGCGTT CATCGAGACG
ATCCTCGTAC GCGCGGGCTA CAAGGTCGGC TGCCACACGT CGCCGCACCT GCTGCGCTTT
AACGAGCGCG CGCGCCTCGA CGGGCGGATC GTCGACGACG AGGAACTGCT GCCGCACTTC
GAGGCGGTCG AGGCGGCGCG CACGAGCCTG CCGGAGGCGG TGTCGCTCAC GTACTTCGAG
TTCACGACGC TCGCGATCAT GCATTTGTTC GCATCGCGCG GGCTCGACGC GGTGATTCTC
GAAGTGGGGC TCGGCGGCCG GCTCGACGCG GTCAACGTGA TCGACGCCGA TTGCGCGATC
GTGACGAGCA TCGACGTCGA CCACATCGAA TATCTCGGCG ACACGCGCGA GAAGATCGCG
TTCGAGAAGG CGGGCATCTT TCGGCCGGGC AAGCCCGCGA TCTGCGGCGA CCCGGCGCCG
CCGCAGACGC TCGTCGACCA CGCGGGCGCG ATCGGCGCGG ATCTGTGGCT CGTCGGGCGC
GATTTCCGCT TCTCGACGCA GCCGGGCAGC GAGCGCCAGC AGTGGACGTA CGCCGGCCGC
GACAAGCGCT ATCCGGCGCT CGCGTATCCG GCGCTGCGCG GCGCGAACCA GTTGCTCAAC
GCGTCGGCGG CGCTCGCCGC GCTCGAGGCG CTGCGCGAGC GGCTGCCCGT GTCCGCGCAG
GACATCCGGC TCGGGCTCGC GAACGTCGAG CTGCCGGGGC GCTTCCAGGT GCTGCCCGGC
AAGCCGCTCG TGCTGCTCGA CGTCGCGCAT AACCCGCACG CGGCCGCGGT GCTCGCGCAG
AACCTCGATT CGATGGGCTA CTACCCGTAC ACGCACGCGG TGTTCGGCGC GATGGCCGAC
AAGGATCTCG CGGGAATCGT CGAGCGGCTG AAGGGCGCGA TCGATCACTG GCATCTGACC
GATTTGCCGC TGCCGCGCGC GGCAGCGGCC GACGTGCTCG AGCGCGTGCT GCGCGGCGCG
GGCGTCGAGC ACGGCGCGCA GCACAACATC ACGCGCCATG CGGGCCCGGC CGATGCATTC
CTCGATGCAC TAAAAAGCGC ATCCGACAAT GATAGAATCG TGGTTTTCGG TAGCTTCTAC
ACGGTAGCGG GCGTGATGCC CGTCGTGGAC CGCCGCCATG ACCACTGA
 
Protein sequence
MFFGVFASMR RFDSKLDSMS TFPTLDAWLS HLESAHPVGI DMGLARIGQV KDALKLAFAC 
PVITVGGTNG KGSTCAFIET ILVRAGYKVG CHTSPHLLRF NERARLDGRI VDDEELLPHF
EAVEAARTSL PEAVSLTYFE FTTLAIMHLF ASRGLDAVIL EVGLGGRLDA VNVIDADCAI
VTSIDVDHIE YLGDTREKIA FEKAGIFRPG KPAICGDPAP PQTLVDHAGA IGADLWLVGR
DFRFSTQPGS ERQQWTYAGR DKRYPALAYP ALRGANQLLN ASAALAALEA LRERLPVSAQ
DIRLGLANVE LPGRFQVLPG KPLVLLDVAH NPHAAAVLAQ NLDSMGYYPY THAVFGAMAD
KDLAGIVERL KGAIDHWHLT DLPLPRAAAA DVLERVLRGA GVEHGAQHNI TRHAGPADAF
LDALKSASDN DRIVVFGSFY TVAGVMPVVD RRHDH