Gene Francci3_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2831 
Symbol 
ID3904743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3334237 
End bp3335460 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID637880152 
Producthomoserine O-acetyltransferase 
Protein accessionYP_481918 
Protein GI86741518 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.630893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGTT CCGTGCGGCC AAGAGCTGAG TCGCGTGGCA CCGGTGCGGC CCTTTCGGTC 
GAACCGGTAC CGCCGACGGT CCAACCGATA CCGCCGACGG TCCAACCGAT ACCGCCGACG
CCACCGCCCG CCTCCGGGGC ATGGCGCGCC GGGATCGACC CGGTGGGACG GCGCCGGTTC
GTCGACCTGC CGGGACCGCT GCAGCTGGAA CGCGGCGGCA TCCTGCCCGG CGTGACGGTG
GCCTACGAGA CGTGGGGCCG CCTCGACGCC GCGGCCACCA ACGCGGTGCT GGTCCTGCAC
GCGCTCACCG GGGACAGTCA CGCCGTCGGC CCGCCCGGGC CGGGCCATCC CACCCCAGGC
TGGTGGGATG GCCTGATCGG GCCTGGGCGG GCCCTCGATA CCGATCGCCT CTTCGTGGTC
TGTCCGAATG TGCTGGGCGG CTGTCAGGGC ACGACCGGGC CAGCCAGTCC CGCGCCGGAC
GGCCGACCCT GGGGCGGCCG ATGGCCCGAG ATCACGATCT CCGATCAGGT CACGGTGGAG
GTCGCCGTCG CCGACGCGCT CGGCATCCGG CGCTGGGCCG CGGTGGTCGG CGGCTCGATG
GGGGGCATGC GGGCCCTGGA GTGGGCTGTC GGCCATCCCG ACCGGGTCGA CCACGCCGTG
GTCCTGGCCT GCGGCGCGGC TGCGACGGCG GAGCAGATCG GGTTGTCCGC GGTGCAGCTT
CGTGCGATCA TCGACGACCC GGCCTGGAAC GGCGGCGACT ACCACGGCCG GCCCGGCGGA
CGCGGCCCGG ACGCCGGCAT GGGTCTGGCC CGGCGGGTGG CCCAGATCAG CTATCGCAGC
GAGGCCGAAC TGGAGGAGCG GTTCGCGGAT CGGACCCGGC CCGACGGGTT GTTCGAGGTC
GCCTCCTACC TCGACCACCA TGCCGGCAAG CTGGCCGCTC GGTTCGACGC CGGCACTTAC
GTCGCACTGA CCCGGGCGAT GATGACCCAG GACGTCGGCC GGGGGCGCGG GGGGCGCGCG
TCGGCGCTAC GGTCCTGCCC GGTGCCGTTC ACCGTCGCGG GGGTCGACTC CGACCGGCTC
TATCCCCTCC ATCTGCAGGA GTACATCGCC GAGCGCGTCG GCGCGCCGTT GCGCGTCGTC
CACTCGCGGC GCGGGCACGA CGGGTTTCTG ATCGAGACCG AGCAGGTCGC CGCAATCGTC
CACGACGCCC TCCGGACGGC CTGA
 
Protein sequence
MPGSVRPRAE SRGTGAALSV EPVPPTVQPI PPTVQPIPPT PPPASGAWRA GIDPVGRRRF 
VDLPGPLQLE RGGILPGVTV AYETWGRLDA AATNAVLVLH ALTGDSHAVG PPGPGHPTPG
WWDGLIGPGR ALDTDRLFVV CPNVLGGCQG TTGPASPAPD GRPWGGRWPE ITISDQVTVE
VAVADALGIR RWAAVVGGSM GGMRALEWAV GHPDRVDHAV VLACGAAATA EQIGLSAVQL
RAIIDDPAWN GGDYHGRPGG RGPDAGMGLA RRVAQISYRS EAELEERFAD RTRPDGLFEV
ASYLDHHAGK LAARFDAGTY VALTRAMMTQ DVGRGRGGRA SALRSCPVPF TVAGVDSDRL
YPLHLQEYIA ERVGAPLRVV HSRRGHDGFL IETEQVAAIV HDALRTA