Gene Franean1_4254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4254 
Symbol 
ID5672609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5073629 
End bp5074873 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content76% 
IMG OID641243127 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001508544 
Protein GI158316036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.218969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.848752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AGCGTCAGGA CGCCGCACCC GCCAGCCAGC CGGCGCCGCC ACCACGGCCA 
CCATCGGGTC CGCGGCCGAC GCCGGCACCG GCACCGCCGT CGCCGTCGCC GTCGCCGCCG
CCGCCGGCGT CGGCCGCGTG GCGCCCCGGC GACCCGGTGG GAGATCGCCG GTTCGTCAGG
TCGGCCGGCC CGCTGACCCT GGAACGCGGT GGTTCGCTGC CCGAGGTGAC GATCGCCTAC
GAGACGTGGG GCACGCTCGC GCCCGACGCG GGTAACGCCG TGCTGGTGCT GCACGCGCTC
ACCGGTGACA GTCACGCCGC CGGCGGCGCC GGGCCGGGGC ATCCGACCCC GGGCTGGTGG
GGTGGGCTGG TGGGGCCGGG CGCAGTCCTC GACACCGACC GGTTCTTCGT CGTCTGCCCG
AACGTCCTCG GCGGCTGTCA GGGCACGACG GGCCCGGCCT CCGCCGCATC AGACGGACGT
CCCTGGGGCG GCCGCTGGCC GGAGATCACC GTCGGCGACC AGGTCAGGGC GGAGGCCCTG
CTGGCCGACG AGCTCGGCGT GGGCCGGTGG GCGGCCGTCA TCGGCGGCTC GATGGGCGGG
ATGAGGGCGC TGGAATGGGC GCTGGCGTTC CCCGCCCGGG TACGGCGCGC CGTCGTGATC
GCCTGCGGAG CGACGGCGAC CGCCGAACAG ATCGCGCTCT ACGCCACCCA GCTCGCGATG
ATCCGCGCCG ACCCGAACTG GCACGGCGGT GACTACCACG ACCGCCCGCC CGGGGCCGGG
CCGCACGTCG GGCTGGGCCT GGCCCGGCAG ATGGGGCAGG TCAGCTACCG CAGCGAGCGC
GAGCTGGCCC ACCGGTTCGG CAACGCGGTG CAGGCGGACG GCCGCTACGC CGCCGCATCC
TACGTCGAGC ATCACGGGGC GAAGCTGGCG CACCGTTTCG ACGCCGGCAG TTACCTCACG
CTGACCGCGG CGATGATGAG CCAGGACGCC GGCCGCGGCC GTGGCGGCGT GCCGGCGGCG
CTGCGGGCCT GCCCCGTCCC GGTGACGGTC GCCGGCATCG ACAGTGACCG CCTCTACCCG
CCGCGCCTGC AGGCCGAGCT CGCCCGCCAC CTGGGCACCG AGCTGCGTCT CGTCCCGTCC
GCGTCGGGCC ACGACGGCTT CCTGCTGGAG ACGGCCGCCG TCGGCCAGAT CGTGCGCGAC
GCGCTCACTC CCGCGGCGAC GGGCCCAAGG ACACCGTCGT CATGA
 
Protein sequence
MTDQRQDAAP ASQPAPPPRP PSGPRPTPAP APPSPSPSPP PPASAAWRPG DPVGDRRFVR 
SAGPLTLERG GSLPEVTIAY ETWGTLAPDA GNAVLVLHAL TGDSHAAGGA GPGHPTPGWW
GGLVGPGAVL DTDRFFVVCP NVLGGCQGTT GPASAASDGR PWGGRWPEIT VGDQVRAEAL
LADELGVGRW AAVIGGSMGG MRALEWALAF PARVRRAVVI ACGATATAEQ IALYATQLAM
IRADPNWHGG DYHDRPPGAG PHVGLGLARQ MGQVSYRSER ELAHRFGNAV QADGRYAAAS
YVEHHGAKLA HRFDAGSYLT LTAAMMSQDA GRGRGGVPAA LRACPVPVTV AGIDSDRLYP
PRLQAELARH LGTELRLVPS ASGHDGFLLE TAAVGQIVRD ALTPAATGPR TPSS