Gene Franean1_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4226 
Symbol 
ID5672581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5033145 
End bp5034080 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content66% 
IMG OID641243099 
Productsulfate adenylyltransferase subunit 2 
Protein accessionYP_001508516 
Protein GI158316008 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID[TIGR02039] sulfate adenylyltransferase, small subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0954723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAA GTCCATTCAT GAGAACTCCA CTGATGCCGA GGATGACGCA TCTGCAGCGG 
CTGGAGGCGG AGAGCATCCA GATCTTCCGG GAGGCGGTGT CCGAGAGCGA GCGGCCGGTG
ATGCTGTACT CGGTCGGCAA GGACAGCTCG GTGATGCTGC ACCTGGCGAT GAAGGCGTTC
TACCCGTCGA AGCCGCCGTT CTCCCTGCTG CACGTGGACA CGACCTGGAA GTTCCGGGAG
ATGTACGAGT TCCGCGACAG TGTCGTCGAC CAGCTCGGCG TCGAGCTGCT GGTGCACCAG
AACCCGGAGT GCGTCAAGCG GGGGATCAAT CCGTTCGACC ACGGTTCGGC GACGCATACC
GACCTGTGGA AGACCGAGGG TCTCAAGCAG GCGCTGGACA GGTACGGCTT CGACCTGGCG
TTCGGGGGTG CGCGCCGCGA CGAGGAGAAG TCCCGGGCGA AGGAGCGGGT GTTCTCCATC
CGGTCCGCCC AGCACCGGTG GGATCCCAAG GCACAGCGGC CGGAGCTGTG GCGTCTCTAC
AACGCGCGCA CGCAGCCCGG GCAGAGCGTC CGCGTGTTCC CGCTGTCCAA CTGGACCGAA
CTGGACGTGT GGCAGTACAT CCACCGCGAG CGGATCCCGA TCGTTCCCCT CTACTTCGCC
GCGCACCGCC CCGTCGTCGA ACGGGACGGC GCGCTGATCA TGGTCGACGA CGACCGCATG
CCGCTGCGTC CCGGCGAGGT GCCGGCCCGA CGCAGCGTCC GTTTCCGCAC GCTGGGGTGC
TACCCGCTGA CCGGCGCGGT GGAGAGCACC GCCGGTACTC TCCCGCAGAT CATCCAGGAG
ATGTTGCTGA CGACGAGCTC GGAACGCCAG GGCCGGGTGA TCGACCATGA CTCGTCCGGG
TCGATGGAGA AGAAGAAGCA GGAGGGGTAC TTCTGA
 
Protein sequence
MERSPFMRTP LMPRMTHLQR LEAESIQIFR EAVSESERPV MLYSVGKDSS VMLHLAMKAF 
YPSKPPFSLL HVDTTWKFRE MYEFRDSVVD QLGVELLVHQ NPECVKRGIN PFDHGSATHT
DLWKTEGLKQ ALDRYGFDLA FGGARRDEEK SRAKERVFSI RSAQHRWDPK AQRPELWRLY
NARTQPGQSV RVFPLSNWTE LDVWQYIHRE RIPIVPLYFA AHRPVVERDG ALIMVDDDRM
PLRPGEVPAR RSVRFRTLGC YPLTGAVEST AGTLPQIIQE MLLTTSSERQ GRVIDHDSSG
SMEKKKQEGY F