Gene Franean1_5411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5411 
Symbol 
ID5673742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6527763 
End bp6529334 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID641244266 
Producthypothetical protein 
Protein accessionYP_001509672 
Protein GI158317164 
COG category 
COG ID 
TIGRFAM ID[TIGR02946] acyltransferase, WS/DGAT/MGAT 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0824617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC GGCCGCTTAC TCCGATGGAC GCGCTCATAC TCGGATACCA GCGAGCATTT 
CCGAAGACGC CGCTCGCGGT CGGTTGCCTG TTGGTCGCCG ACGGCCCCGT GCCAGGAGTC
CAGGCGTTGC GCGAGCTCGT CGCGGACCGG GCACACGACT TTCCTCCACT GGCCCACCGG
ATCGCCTCGG CCGGGCGTGG CCGGCCGGTC TGGGTGGCGG ACGCCGATTT CGACCCGGCC
CGGCACGTTC ACGAGTACCG GCTCCCCGCG GAGTCGGGGC TCGCCGGACT GCGCGAGGCG
GTGGGGCGGC TGTCAGCGGT CGAGATCTCG TTGGACGCGC CGCCGTGGCA GCTCTGGTTG
CTCCACGGCC TCCGCCGCAA CGGCTTCACC CTGCTGTACC GCGCCAGCCA CGTGTGGACA
GACGGGACGG CACTGAACCT GGTGCTGGAA AAGCTGTTCG GCCTGTCCGA CCCAGGATCG
GAGCGCGCAC CTCTGCGGGT GTCGCCGGAT CGCCGGCCCG GCCCCCGAAC CGTCTGTCGG
GCCGCCACTC ACTCACTCGG ATGGCTTACC CGGACGTCCA CGATCGGTCC GCTCTCGGCG
GCACCCACGG GATACCCGCA CCACACCTGG CTGGAGGTGG ACCTGTCCAG GCTACGGGCG
ATCAGCCGCG CCTACGAGGT CACCGTGAAC GACATCTTCC TGGCCGCGCT CACCGGCGCT
CTGCGTGCCT GGCCGCGCCC CGGCTCCGAT CGGCCCGGCC AGCGCCGGGG CCAGATGCAC
GCCGCGATGC CGGTCAGTAC CCGCCGGGCA GCCCAGCGGG ACCACATGAG CAACTACCTG
ACCACCGTAC GGATCGCGCT GCCTTACGGT GAGTCGTTGA TCCACCGGCG TGTGGAGGCG
ATCCACCGGC AGACCGTCCG ACACAAGCGG GGCGGGACTC CGGGCGTCGC GGAACATCTC
TTCCTCTGGG CGATTCCCGA ACCGTTGCGA CCGGCCGTGC TGTCCACCGG GATCATGTCC
CACGTCTTCG CGCTGACCGC ATCCAACCCG GGCGGCCTGA CCGGCCCGCT GGAAATCCTC
GGCCGACCAG TCACCGCCGC CGTGCCCACC CCTCCGCTCC CCGCAGGCCA ACGCCTGGCA
GTCCTGCTCG GCGGGCTGGA CGGGCAGGCG TGCATCGGTT TCACGATGGA CGGGTCGGTG
CGGGACGGGG CGCGACTGCC GGAACTCGTC GAGGCCGAAC TGGACGCGCT CGAGGCAGCG
GCCGGCCTCC GCCACGGCCC GGCCCGGAGC CCGGCCGAGA CCACGCATTC GGCACGGCCG
CATCCGGTCG CCATGACCGG TATCGGCACC AGCGCTATCG AGGCGGCGGG TCTGGTGGGG
CGCGGTGCTT ATCACCTTCG TCGCTGGCTC ACACGGGGCC AGCGGACCGA CACGCCTGGC
ACGGAACTGG TTCAGGGCAC CAGCGACGCG GCAGGTCGAA CGCCCTACAG TCAACTCTGC
TCCCGACGGC TCCAGCAACC CCGGCCAGCC GGTGCTTCGC CAGCAGAGGA GAGGCGAGTA
CTACATTCCT AG
 
Protein sequence
MTARPLTPMD ALILGYQRAF PKTPLAVGCL LVADGPVPGV QALRELVADR AHDFPPLAHR 
IASAGRGRPV WVADADFDPA RHVHEYRLPA ESGLAGLREA VGRLSAVEIS LDAPPWQLWL
LHGLRRNGFT LLYRASHVWT DGTALNLVLE KLFGLSDPGS ERAPLRVSPD RRPGPRTVCR
AATHSLGWLT RTSTIGPLSA APTGYPHHTW LEVDLSRLRA ISRAYEVTVN DIFLAALTGA
LRAWPRPGSD RPGQRRGQMH AAMPVSTRRA AQRDHMSNYL TTVRIALPYG ESLIHRRVEA
IHRQTVRHKR GGTPGVAEHL FLWAIPEPLR PAVLSTGIMS HVFALTASNP GGLTGPLEIL
GRPVTAAVPT PPLPAGQRLA VLLGGLDGQA CIGFTMDGSV RDGARLPELV EAELDALEAA
AGLRHGPARS PAETTHSARP HPVAMTGIGT SAIEAAGLVG RGAYHLRRWL TRGQRTDTPG
TELVQGTSDA AGRTPYSQLC SRRLQQPRPA GASPAEERRV LHS