Gene Franean1_3389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3389 
Symbol 
ID5671760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4014956 
End bp4016098 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content69% 
IMG OID641242277 
ProductPropanoyl-CoA C-acyltransferase 
Protein accessionYP_001507697 
Protein GI158315189 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0952002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG TCGCCATCAT CGGCGTGGGC CTGCATCCCT TCGGTCGCTT CGAGAAGTCG 
GCCATGGAGC TCGGCGCGGA CGCGATCCAG CTGGCGCTGA AGGATGCCGG GATCGAGTGG
AAGGACATCC AGTTCGGTGT CGGTGGCAGC CTGGAGGTCG CCAACCCGGA CGCGGTGACG
AGGCTCGTCG GGCTGACCGG CATCCCGTTC ACCGACGTGT TCAACGCCTG CGCGACCGCG
GCCAGCGCCA TCCAGCTGTG CGCCGACACG ATCCGGCTCG GCAAGTACGA CATCGGCATC
GCCGTGGGCA TGGACAAGCA CCCGCGCGGT GCTTTCACCG CCGACCCGTC GATGCTCGGC
CTGCCGTCGT GGTATGCCGA GAACGGCCAG TTCGTCACCA CGCAGTTCTT CGGGATCAAG
GCCAACCGCT ACCTCCACGA GCACGGCATC TCGCAGCGGA CGCTGGCGAA GGTCGCCGCC
AAGAACTACC GCAACGGCGT GCTCAACCCG AACGCCTTCC GGCGTAAGCC GTTGAGCGAG
GAGGAGATCC TCGGCTCGCC CATGCTCAAC TATCCGTTGA CGCACTACAT GTTCTGCTCG
CCGGACGAGG GCGCCGCCGC CGCCATCATG TGCCGCGCCG ACATCGCGCA CCGGTTCACC
TCGCAGCCGA TCTACCTGCG CGCCGCGGAG ATCCGCACCC GCCGCTTCGG CGCCTACGAG
GTGCACAGCA CCTTCGCACC GGTCGACGAG GACGTCGCGC CGACCGTCTA CGCCGCCCGC
GCCGCCTTCG AGGCGGCCGG CGTCGGCCCG GGCGACGTCG ACGTGATCCA GCTTCAGGAC
ACGGATGCCG GCGCGGAGAT CATTCACATG GCCGAGTGCG GCTTCTGCGC CGACGGTGAG
CAGGAGAAGC TGCTCGCCGA GGGCGCGACC GAGATCAACG GCCCGTTGCC GGTCAACACC
GACGGCGGCC TCATCGCCAA CGGCGAGCCG ATCGGCGCAT CCGGGCTCCG CCAGGTGCAC
GAGCTGGTCC GCCAGCTGCG TGGTCAGGCG GGTGACCGTC AGGTCGCCGG CAACCCGCGC
GTCGGATTCG CCCAGGTCTA CGGCGCCCCC GGCACGGCCG CGGCCACCGT CCTCACCGTC
TGA
 
Protein sequence
MNDVAIIGVG LHPFGRFEKS AMELGADAIQ LALKDAGIEW KDIQFGVGGS LEVANPDAVT 
RLVGLTGIPF TDVFNACATA ASAIQLCADT IRLGKYDIGI AVGMDKHPRG AFTADPSMLG
LPSWYAENGQ FVTTQFFGIK ANRYLHEHGI SQRTLAKVAA KNYRNGVLNP NAFRRKPLSE
EEILGSPMLN YPLTHYMFCS PDEGAAAAIM CRADIAHRFT SQPIYLRAAE IRTRRFGAYE
VHSTFAPVDE DVAPTVYAAR AAFEAAGVGP GDVDVIQLQD TDAGAEIIHM AECGFCADGE
QEKLLAEGAT EINGPLPVNT DGGLIANGEP IGASGLRQVH ELVRQLRGQA GDRQVAGNPR
VGFAQVYGAP GTAAATVLTV