Gene Franean1_4716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4716 
Symbol 
ID5673058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5630060 
End bp5631421 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID641243573 
ProductDitF protein 
Protein accessionYP_001508989 
Protein GI158316481 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.300284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTCCCC GGGTACGAGC CGCCCTATGT CGTGGCCCGG GTGGTGCTCG CCGAGCAGCC 
CGACCTGCAC CTGTTGACCA ATGTGGTCGA CTGCAACGTC GAGGCGGTCA GCACCGGTAT
GGAGGTCGAG GTGACCTTCG AACAGCGAGG GGACGTGTTC GTACCCATGT TCCGGCCGGT
CGTATGAACT CCGCCGGTCC CGTGAACGTC GAGCGCCAGT CGATCGTCTC CGGCATCGGG
CGGTCACCGT CCGGGCGGCG GCTGAACCGG TCCGCAATGG ATCTGACTCT GGACGCCTGT
CTGGCGGCCA TCGCTGATGC CGGTCTGACC CCGCGCGACG TCGACGGGCT CACCTCCTGG
CCCGACCACG CCGCCCCGCA CGGCTTCGGC GGGCCCCGGG TCGGTGAGCT GCACACGCTC
CTGCGCCTGG ACCTGTCGTG GATCCTGGGC TGCGGGGACG GCGCCAACGT CATCGGCATC
CTCGGGATCG CCGCCCACGC GGTGGCCACA GGCCTCGCCC GGCATGTGCT GGTCTACCGG
ACCGTCGGCG AGGCGACCAG CCAGGGCACC GGCCGCCGCC CGGCGGTGAT GGCCGACCCC
GGGGCCGCGC CGTGGAAGGC CACCTACGGG GTCGGTTCGC CGGTGCAGTT CGCGGCCCTG
TGGGCCCAGC ACCATTTCGA TCGTTACGGC ACCACCCGGG AGCAGCTCGG CTGGGTCGCG
GTGAACGACC GGCGTAACGC CGCCGGTAAC CCGGACGCGA TCTACCGCGA CCCGATGACG
ATCGACGACT ACCTCGCGGG GCGGATGATC AGCGAACCGC TGTGTCTGTT CGACTGCGAC
GTGCCGGCGG ACGGGTCGAT CGCCTTCGTT GTCTCGCGCG CCGACCACCG CCGTGACGTC
GACCGGCCCG TCTTCTTCGA AGCCCTGGGT GGTGGGCGGC CGATGACGTC GAGCTGGGAG
TTCTGGCCGG ACCTTGACGT CATGGCCGCG ATGAAGGCCG CCGAGCAGCT GTGGTCGCGT
ACCTCGCTGC GGCCCGGCGA CGTCGACGTC GCCGGTCTCT ACGACGGCTT CAGCATCTTC
GTCCTGTACT GGCTGGAGGC GCTCGGGTTC TGCGGCCGCG GCGAGTCCGG GCCGTTCGTC
GAAGGCGGCA CCCGCATCGC CCGTGACGGT GAGCTGCCAC TCAACACCTC CGGCGGCCAA
CTGTCGGAGG GCCGCTACCT CGGCTTCGGT CTGGCCTACG AGACCTTCCT GCAGTTACGG
AACCAGGCCG GCACCCGGCA GGTCACCGAC GCCGAGGTCG GCCTCGTCAC GGGCGGCGGC
GGCCCGCTCG CCCAGGCATT CCTCTTCACC AACGACCGCT GA
 
Protein sequence
MGPRVRAALC RGPGGARRAA RPAPVDQCGR LQRRGGQHRY GGRGDLRTAR GRVRTHVPAG 
RMNSAGPVNV ERQSIVSGIG RSPSGRRLNR SAMDLTLDAC LAAIADAGLT PRDVDGLTSW
PDHAAPHGFG GPRVGELHTL LRLDLSWILG CGDGANVIGI LGIAAHAVAT GLARHVLVYR
TVGEATSQGT GRRPAVMADP GAAPWKATYG VGSPVQFAAL WAQHHFDRYG TTREQLGWVA
VNDRRNAAGN PDAIYRDPMT IDDYLAGRMI SEPLCLFDCD VPADGSIAFV VSRADHRRDV
DRPVFFEALG GGRPMTSSWE FWPDLDVMAA MKAAEQLWSR TSLRPGDVDV AGLYDGFSIF
VLYWLEALGF CGRGESGPFV EGGTRIARDG ELPLNTSGGQ LSEGRYLGFG LAYETFLQLR
NQAGTRQVTD AEVGLVTGGG GPLAQAFLFT NDR