Gene Sala_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1061 
Symbol 
ID4082326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1093797 
End bp1095128 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content71% 
IMG OID638009422 
ProductFolC bifunctional protein 
Protein accessionYP_616111 
Protein GI103486550 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACC ACGCGCACAG CACCGACCCC GCCGTCCAGG CCCAGCTCGA CCGCCTCGCC 
GCGCTGTCGC CGGGGCGCGA CATCCTCGGC CTCGAACGCA TCACCGAGCT TTGCCACCGC
CTCGGCGATC CGCAGCACCG GCTGCCGCCC GTCTTCCACG TCGCGGGAAC CAACGGCAAG
GGTTCGACCT GCGCCTTCCT GCGCGCCGCA ATCGAGGCGC AGGGCCTGAC CGCACATGTT
TACTCCAGCC CGCACCTCGT CCGCTTCAAC GAGCGCATCC GTCTTGCCGG AGCGCTGATC
GACGACGCGC TGCTCGCCGC GCTGCTCGCC GAAGTGCTCG ACGTCGCAGA AGACCTGCAA
GCGAGCTTTT TCGAGGTCAC GACCGCCTCC GCCTTCCTCG CCTTTGCGCG CACGCCCGCC
GACGCCTGCA TCGTCGAGGT CGGGCTGGGC GGGCGGCTCG ACGCGACGAA CATCATCGCC
GCACCGACCG TCTGCGGCAT CGCCTCGCTC GGCATCGACC ATGAGGCGTT CCTGCTCGCG
CCCGAGGCCG ATGTGCCCGC GCCGCCGCAC GACCGCATCG CCTTTGAAAA GGTCGGAATC
GCCAAGCCGG GCGCACCGCT GGTGACACAG GCCTATCCGG CGTCGATGCG CGCGGTGATC
GCCGCGCGCG CCGCCGCCGT CGGCGCCCCG CTCATCGAAC GCGGCGCGGC ATGGGATGCC
GCCATCGCCG ACGGGTGCCT TGCCTATCGC GACGCGCGCG GCGCCCTCGA CCTGCCCCTG
CCCGCAATGG CCGGCGCGCA TCAGGCCGAC AATGCCGCGC TCGCAGTCGC GATGCTCCGC
CATCAGGACG CCGTCCGCAT CGACCAGGCC GCTTTCGCCG CCGCCATGAC GCATGCGACC
TGGCCCGCCC GAATGCAGCG GCTCGGCGCA GGGCCGCTGA CCGACCTGCT GCCACAGGGC
ACCGCCGTCT GGCTCGACGG CGGTCACAAC CCCGACGCGG GCATCGCCCT TTCCCGCGCG
CTGGAAGGCG TGGCGCCGCT CCACATCGTC TGCGGCCTGC TCGCGAACAA GAATGCGATG
GGGTTCCTGC GTCCCTTCGC GGACCGCATT GCCTCGTTCA GCGCGGTTCC GATCCCGGGC
CACGAGCATC ATGACCCCAA GGATCTGTGC TGGTGGGTGC AGGACGGGCT TGGCATTGTC
GATGCCCGCC CGTGCGACAA TGTGCCGACC GCGCTGCGCC AGCTTGCGGA CGCGCGCGGC
GACGTCCTCA TCTGCGGCTC GCTCTATCTC GCGGGCGAAG TCCTTGCCGC GAATGGCGAG
GTGCCCGAAT GA
 
Protein sequence
MPDHAHSTDP AVQAQLDRLA ALSPGRDILG LERITELCHR LGDPQHRLPP VFHVAGTNGK 
GSTCAFLRAA IEAQGLTAHV YSSPHLVRFN ERIRLAGALI DDALLAALLA EVLDVAEDLQ
ASFFEVTTAS AFLAFARTPA DACIVEVGLG GRLDATNIIA APTVCGIASL GIDHEAFLLA
PEADVPAPPH DRIAFEKVGI AKPGAPLVTQ AYPASMRAVI AARAAAVGAP LIERGAAWDA
AIADGCLAYR DARGALDLPL PAMAGAHQAD NAALAVAMLR HQDAVRIDQA AFAAAMTHAT
WPARMQRLGA GPLTDLLPQG TAVWLDGGHN PDAGIALSRA LEGVAPLHIV CGLLANKNAM
GFLRPFADRI ASFSAVPIPG HEHHDPKDLC WWVQDGLGIV DARPCDNVPT ALRQLADARG
DVLICGSLYL AGEVLAANGE VPE