Gene Franean1_5497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5497 
Symbol 
ID5673828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6653778 
End bp6656711 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content74% 
IMG OID641244352 
Productacyltransferase 3 
Protein accessionYP_001509758 
Protein GI158317250 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTCC CGGTACGCGA ACGGACACGG CACCGTCCGT GCCAGGCCCC GTCCGGCTCG 
ATCCGGCGGC CTGCCGCCAC ACCTGCGCAC CTACGACATC GGAAGGGGAC CGGGCTGAGC
CGCCGAGACA CCGACACCTA CCCGCTGGCT CGCCCCCGCC CCGGTCAGCC GGAGGACGAC
CACGCCGCCG AGCGCAACCC GCGGCGCCCC GTCCGGGACC GGCGCAGAGC CGCGGACCGA
CGCGCGGACA CGGACAGTGG CACGGACCAC GACGAGGTCA CAGACCATTT GGACAGCCTG
TATCTCGACC GGTCCCGGCT CGACGCCGCC TACCAGGACG ACGCCTACCC GGACCCCGAG
GGCCGCCTGC GCCGGGCCGG CGCGTACAGC GGCGACGAGA CCGAGCACAT CGAGCGCCGG
GCCGCCGCCT CACGCGACCC ACGTGAGGCG GCACGCGACG CCGACGATCA CACCCGCGTC
ATCGCCCGCC CCCGCGTCCC AGCCCGTGCC GGGCGAGCCG CCCGCACGGG GCGCCCCGCG
GGCGACCGGA GCCCGCGGGG CGGCCACGAG CTCGTCGGGA GCGGCCCGCC GCGCCCGCAC
GGCGCCGAGC CGGCCGAGGC CCGCCGGTCC GGTTCCCGCC AGCCGGTGGC CGACCCGTCC
AGGCGACGCG AGCCGGGCGG GCCGCGCACG CCGCGACAGC CGGGCACGTC ACGGCAGACA
GGCACGTCAC GGCAGCCGGC CGGCGGGCGA GGCGCGGGCG AGGCGGACGA CCTCCTCGAA
GCGGACGACC TCCTGGAGGC GGACAGGCCC GGTGCCCGCG CCCGCGGCGC CGCCGACCCG
CATGCGACCG GCCCGCTCGA GACCGGGCAG CCGGACGACG GATCGCCGGA GACCGGGCCG
CTCGAGACCG GACCACTCAC GGCACCGGTA CTGGCACCAT TGGGCCACTC TCCCGCGCTG
GACGGCCTGC GCGCCCTCGC CGTCCTCGGG GTGATGGCCT ACCACGCGGG CCTGAGCTGG
ATGCCGGGCG GACTGCTCGG CGTCGACGCC TTCTTCGTCC TCTCCGGATT CCTGATCACC
GGCCTGCTGG TGGCCGAGTA CCGGACCACC CGGCGCATCG ACCTGAAGTC GTTCTGGATC
CGCCGCACCC GACGTCTGAT GCCGGCACTG CTGACGATGA TGCTGGGCGT GGCGGCCTAC
GCCCGGTTCG TCGCCCAGCC CAGCGAGGTC GAGACACTGC GCGTGGACGC GCTGTCGACC
CTCGGGTACG TCGCCAACTG GCGGTTCGCG CTGTCGGACC AGAGCTATTT CGACCACTTC
TCCGCCCCGT CCCCGCTGCT GCACACCTGG TCGCTGGCGG TGGAGGAACA GTTCTACGTC
CTGTGGCCGC TCGCCGTCTA CCTGCTGATG TGGCACAGCG GGCGCACCAC GGCGCGCTGG
CGCAACCAGC GCCACAAGGC GCAGACCCTG ATCCTGACCG TCGCCGTGCT GGGCGCGGAG
GCGTCCGCGC TGCTCGGGCT GCTGCTGGTC GTGCTGGGCG CGGACGCCTC CCGCATCTAC
TACGGCACGG ACACCCGCGC GCAGGCGCTG CTGGCCGGCG CCGCACTGGC TGTGTGGCGG
ATCCAGCGGC GGACACCGCT CACCGAGCGG ACGAAGAAGG GCCTGTCCGT CGCCGGCTGC
CTCGCCGGGG TGGCGATGCT CACGGTCTGG GCGACCGTCG ACGGCGAGAG CCGGGTGCTC
TACTCCGGCG GCTTCCTGGG CGTGGCGGTG ATCGTGATGC TGCTGATCGC CTCGATCGTC
GAGGTGCCCC GCGGGCCGGC CGCGCGGGTG CTGTCGCTCC CGCCACTACC CACCATCGGC
CGGGTCTCCT ACGGCCTCTA CCTGTGGCAC TGGCCGGTCT TCCTGACCGT CACCGCGGCC
CGCACGGGGC TGAGCGGCGC GGCGCTGCTG GGCGTGCGCG CGGCCGTCAC GGCGGCGATC
ACCATCGCCT CGTTCCACCT GGTCGAAAAC CCGATCAGGC GACGGAAGAT CCGTTTCCCC
ATGCCGCGGG TCACCGTCCC GGCCTCGATC TGCGCGGTCG TCGCCCTGGT GATGGTCGCC
ACGATCGCGG ACCCGTCCGC CGGCCGCGGC GCGGCGGACC TGGAGCAGCT GGCCGCGCGG
GCCGCCACCG CGGCACCGCC GACGGCCCGG GCCGAGCCGG TGACGGGCCG CCCGCTGCGG
GCCGTCCTGG CCGGGGACTC GCTCGCGCTC ACCCTCGGCT TCAGCGAGTT CTCCACGGCC
GCCGCCCGCG AGGGCCTGGA GATCCACGAC GCCTCCCAGC TCGGGTGCGG GGTGACCCGC
TCGCCGCGCC GCCTCCTCAT GGGCGAGGTG AACCTGCCAC CTCCGGGGTG CGACCAGTGG
CCCACCCGCC TCAGCCGGAA AGTCGACGAG CTCCGTCCCG ACCTGGCCAT GCTGCTGGTC
GGGCGCTGGG AGGTAACCGA CCAGGACCTC GAGGGCCGAC GGACCCACAT CGGTGATCCG
GCGTTCGACG CCTATCTCGG GCGCGAGCTC GACCTGGCGA TCACCACCCT CTCGGCCCGC
GGAGCGAAGG TGGTGCTGCT GACCACCCCG ATGTTCGAGG AGGGTGAGGC GGCGGACGGC
AGCATCTACC CGGAAACCCG GGCCGACCGG GTGATCCAGT TCAACAAGCT GCTTCGCCAG
GCCGCCGCGC GGCACTCCGC CGTCACCACC GTCATCGATC TCGCCGCGGC ACTCAGCCCG
GGCAACGAGT ACCGCGGTGA GATCGACGGC GTTCGCGTGC GCGACGACGA CGGTGTTCAC
ATCAGCAACG GCGGTGGAGC GCGGGCCGGC CAGATGGTGC TACCCGAGCT TCTCAAGCTG
GCCCGCCCGG CCACGGCGCG TGGTGCCGAC GGGCCGGCCC CGGCACCCGA GTGA
 
Protein sequence
MLLPVRERTR HRPCQAPSGS IRRPAATPAH LRHRKGTGLS RRDTDTYPLA RPRPGQPEDD 
HAAERNPRRP VRDRRRAADR RADTDSGTDH DEVTDHLDSL YLDRSRLDAA YQDDAYPDPE
GRLRRAGAYS GDETEHIERR AAASRDPREA ARDADDHTRV IARPRVPARA GRAARTGRPA
GDRSPRGGHE LVGSGPPRPH GAEPAEARRS GSRQPVADPS RRREPGGPRT PRQPGTSRQT
GTSRQPAGGR GAGEADDLLE ADDLLEADRP GARARGAADP HATGPLETGQ PDDGSPETGP
LETGPLTAPV LAPLGHSPAL DGLRALAVLG VMAYHAGLSW MPGGLLGVDA FFVLSGFLIT
GLLVAEYRTT RRIDLKSFWI RRTRRLMPAL LTMMLGVAAY ARFVAQPSEV ETLRVDALST
LGYVANWRFA LSDQSYFDHF SAPSPLLHTW SLAVEEQFYV LWPLAVYLLM WHSGRTTARW
RNQRHKAQTL ILTVAVLGAE ASALLGLLLV VLGADASRIY YGTDTRAQAL LAGAALAVWR
IQRRTPLTER TKKGLSVAGC LAGVAMLTVW ATVDGESRVL YSGGFLGVAV IVMLLIASIV
EVPRGPAARV LSLPPLPTIG RVSYGLYLWH WPVFLTVTAA RTGLSGAALL GVRAAVTAAI
TIASFHLVEN PIRRRKIRFP MPRVTVPASI CAVVALVMVA TIADPSAGRG AADLEQLAAR
AATAAPPTAR AEPVTGRPLR AVLAGDSLAL TLGFSEFSTA AAREGLEIHD ASQLGCGVTR
SPRRLLMGEV NLPPPGCDQW PTRLSRKVDE LRPDLAMLLV GRWEVTDQDL EGRRTHIGDP
AFDAYLGREL DLAITTLSAR GAKVVLLTTP MFEEGEAADG SIYPETRADR VIQFNKLLRQ
AAARHSAVTT VIDLAAALSP GNEYRGEIDG VRVRDDDGVH ISNGGGARAG QMVLPELLKL
ARPATARGAD GPAPAPE