Gene Franean1_4257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4257 
Symbol 
ID5672612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5078084 
End bp5079490 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content74% 
IMG OID641243130 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001508547 
Protein GI158316039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0911184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.390443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA CCATCGGCGC CTCCCCGGCC CGCCAGAACA CCGCCGCGCC GGCAGGCCAG 
AAGGCCGTCA CGCCGGCGCC CCGCGAGAGG GGCGAGCCGG GGCGGCCGGC GGCGGGCACG
CCGGACTGGT CGTTCGAGAC CCGGCAGATC CACGCCGGGA CCCGCCCCGA TCCCACGACC
GGAGCCCGTG CGGTGCCGAT CTACCAGAGC ACCTCGTTCG TGTTCCCGGA CGCGGCCTAC
GCCGCCGGTC TGTTCGCCCT CACCGAGACG GGTTTCACCT ACACACGGGT GGTGAACCCG
ACGCAGGACG CCCTCGAGCA GCGCGTCGCC GACCTCGAGG GCGGTGTCGC CGCCCTGGCG
ACAGCCAGCG GGCAGGCCGC GCAGACCCTG GCCGTCCTCA ACCTCGCCTC CGCCGGCGAC
CACCTGGTGT CGTCCGCGTC GCTCTACGGC GGAACGTACG CCCTGTTCCG CCACACCCTG
GCCGACCTGG GCATCGAGAC CACGTTCGTC GACGACCCGG ACGACCTCGA CGCCTGGCGC
GCGGCAATCC GACCGAACAC CGCGGCGTTC TACGGCGAGA CGATCGGCAA CCCGCGCGGG
AACGTCCTCG ACATCGCCGG CGTCGCGGGG GTCGCGCACG AGGCCGGGAT ACCGCTGCTC
GTCGACAACA CGCTGGCCTC GCCCTACCTG GCCCGCCCAC TCGAGCATGG GGCTGACGTC
GTGCTGCACT CGGCCACGAA GTTCATCGGC GGGCACGGGA CATCCATCGG CGGGATCATC
GTTGACGGCG GCCGGTTTGA CTGGGGCAAC GGGCGCTACC CGCGGTTCAC CGAACCGGAC
CCGGCTTACG ACGGCCTGGT GTTCCTCGAC GCCTTCCCCG AGACGGCCTT CATCCTGCGG
GCCCGCGCGC GGCTGCTGCG CGACCTCGGC CCCGCGCTGG CGCCGATGAA CGCGTTCCTG
CTGTTGCAGG GCCTGGAGAC GCTGTCGCTG CGGATGGAGC GGCACAGCGC CAACGCGCTG
CGCGTCGCCG AGTGGCTGGC CGCGCGCGAC GAGGTCGGCT GGGTGGCCTA CCCGGGGCTG
CGCTCCAGCC CGTGGCACGC CGCAGCGGCG CGTTACCTGC CGCGCGGCAC CGGGGCGGTG
GTCGCGTTCG GCCTGCGCGG CGGGGCGGCG GCCGGGCCCG CGTTCATCGA GGCGCTGGAG
CTGCACAGCC ACCTGGCGAA CGTCGGCGAC GTCCGCTCGC TGGCCATCCA CCCGGCGTCC
ACCACGCACG CGCAGCTGCG CGAGGACGAA CGGCTCACCA GCGGGGTATC CGCCGACCTG
GTCAGGCTCT GCGTCGGCCT CGAGGGCGTG GACGACATCC TCGCCGACCT CGACGGGGCG
TTCCGCGCGA TCGCGGGACC CGCCTGA
 
Protein sequence
MTDTIGASPA RQNTAAPAGQ KAVTPAPRER GEPGRPAAGT PDWSFETRQI HAGTRPDPTT 
GARAVPIYQS TSFVFPDAAY AAGLFALTET GFTYTRVVNP TQDALEQRVA DLEGGVAALA
TASGQAAQTL AVLNLASAGD HLVSSASLYG GTYALFRHTL ADLGIETTFV DDPDDLDAWR
AAIRPNTAAF YGETIGNPRG NVLDIAGVAG VAHEAGIPLL VDNTLASPYL ARPLEHGADV
VLHSATKFIG GHGTSIGGII VDGGRFDWGN GRYPRFTEPD PAYDGLVFLD AFPETAFILR
ARARLLRDLG PALAPMNAFL LLQGLETLSL RMERHSANAL RVAEWLAARD EVGWVAYPGL
RSSPWHAAAA RYLPRGTGAV VAFGLRGGAA AGPAFIEALE LHSHLANVGD VRSLAIHPAS
TTHAQLREDE RLTSGVSADL VRLCVGLEGV DDILADLDGA FRAIAGPA