Gene Sare_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3871 
Symbol 
ID5707465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4406331 
End bp4407665 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID641273292 
ProductFolC bifunctional protein 
Protein accessionYP_001538654 
Protein GI159039401 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000822129 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGGAC ACCCCGACTT CGCCGCCGTC GAGGCTGAGC TCGCCACACG CGGGTTCACC 
CGGATGGTCT TCGAACTGGA CCGAATCGAG ACGCTGCTGG ATCTGCTCGG GAGCCCGCAG
CGGGCGTACC CGTCGATCCA CCTCACCGGC ACCAACGGAA AGACCTCGAC GGCCCGCATG
ATCGATTCGC TGTTACGGGC GTTCGGGCTG CACACCGGGC GGTACACCAG TCCGCACCTG
GAGACTGTCC GGGAGCGGAT CAGCCTTGCC GGTGAACCGG TCGACGAGCA GCGCTTCGTC
GACACCTACC GCGAGGTGGC GCCGCTGGCC CGACTCGTCG ACGAGCGGTC CGCGGAGCCC
CTGACCTACT TCGACCTGAC CACGGCGCTG GCCTTCGCCA CGTTCGCCGA CGCCCCGGTC
GACGTCGCGG TCGTCGAGGT GGGTCTCGGC GGGGCGGAGG ACTCGACCAA CGTGCTCCAG
GCCGGTGTCG CGGTGTTGAC CCCGATCGGG CTCGACCACA CCGAGTGGCT CGGCGACAGG
GTCGAGGACA TCGCGCTGCA CAAGGCGGGC ATCATCCACA AGGGCGCCAC GGTGATCTCC
GCTGAGCAGC AGGAAGAGGC GGCGCGTCCG ATTCTCGAGC GCTGCGCCGA GGTCGGCGCG
ACGATCGCCC GGGAGGGTGG CGAGTTCGGG GTGTTGAGCC GAGCGGTCGC CGTCGGCGGT
CAGGTACTCA CCCTGCAGGG GCTCGGCGGT CGGTACGAGG AGATCTTCGT CCCGTTGCAC
GGTGCCCATC AGGCGCAGAA CGCCGCGGTG GCGCTTGCGG CCGTAGAGGC GTTCCTCGGC
GCGGGTACCC GCCGGCAGTT GGACGTCGAA ACGGTCCGGG AGGGGTTCGC GACAGTCACC
TCGCCGGGTC GGCTGGAGCG AGTCCGTGCC GCGCCGACCG TGTTGCTCGA TGGTGCGCAC
AACCCGCAGG GTATGGCCGC CACGGTCACC GCGTTGCAGG AGGAGTTTGC GTTCAGCAAG
CTGGTCGCCG TCCTCGGTGT GCTCGGTGAC AAGGATGTGA CCCGTCTGCT GGAACTGCTG
GAGCCGGTCA TCGATCAGTT GGTGGTCACC CGCAACAGTT CACCGCGGGC GATGGCGACC
CAGGAACTGG CGACACTCGC CGCCGAGGTG TTCGGACCGG ACCGGGTGGC GGTGGCAGAG
CAGATGCCGG ACGCCATCGA AGTGGCGGTG GCGTTGGCCG AGGAAGACGT CCCTGGTGAG
CTGGCCGGGG TCGGCGTACT CGTCACCGGT TCGGTGGTGA CCGTGGCCGA CGCCCGCCGG
CTGCTCAAGC GATGA
 
Protein sequence
MTGHPDFAAV EAELATRGFT RMVFELDRIE TLLDLLGSPQ RAYPSIHLTG TNGKTSTARM 
IDSLLRAFGL HTGRYTSPHL ETVRERISLA GEPVDEQRFV DTYREVAPLA RLVDERSAEP
LTYFDLTTAL AFATFADAPV DVAVVEVGLG GAEDSTNVLQ AGVAVLTPIG LDHTEWLGDR
VEDIALHKAG IIHKGATVIS AEQQEEAARP ILERCAEVGA TIAREGGEFG VLSRAVAVGG
QVLTLQGLGG RYEEIFVPLH GAHQAQNAAV ALAAVEAFLG AGTRRQLDVE TVREGFATVT
SPGRLERVRA APTVLLDGAH NPQGMAATVT ALQEEFAFSK LVAVLGVLGD KDVTRLLELL
EPVIDQLVVT RNSSPRAMAT QELATLAAEV FGPDRVAVAE QMPDAIEVAV ALAEEDVPGE
LAGVGVLVTG SVVTVADARR LLKR