Gene Franean1_0238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0238 
Symbol 
ID5668663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp289991 
End bp292210 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content68% 
IMG OID641239167 
Producthypothetical protein 
Protein accessionYP_001504611 
Protein GI158312103 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.855121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.712535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAATCG CGATCGGTCT AGCGATCACG CTGATCGCTC TCGCGGTGGC CGGCCGCCGC 
GTCTTCTGGC TGACCAGGCT GATCAGGTCC GGTCAACCGG CGGAGGGCCG GCTCGACGAC
CTGCCGACCC GGATCTGGAC CGAGATCAGC GAGGTCGGCG GCCAGCGCAA GCTGCTCAAG
TGGTCGGTGC CCGGCCTGGC GCACGCCTTC ACCTTCTGGG GCTTCACCAT CCTCGGCCTG
ACGATCGTCG AGGCGTACGG GGCCCTGTTC GACCACGACT TCCACATCCC GCTGTTCGGG
CACTGGGCGG CCATCGGGTT CCTCGAGGAC TTCTTCGCAG TCGCCGTGCT CGCGGGCCTG
ATCACCTTCA CGGCCATCAG GCTGCGCAAC GCCCCGGCCC GGCTTGACCG CAAGTCCCGG
TTCTACGGCT CCCACATCGG CCCGGCCTGG GTCATCCTCG GGATGATCAC TCTGGTCATC
GTGACCCTGC TGATCACCCG TGGCGCCCAG TTCAACGCGG GCACCCACCC GCAGGGCGAC
ACCAAGTGGG CGTTCGCGTC CTGGCTGGTC AGCCTGCCGC TGGATGCCTT CTCGGAGCAC
ACCAACGAAC ACATCGAGAC GGTGTTCCTG CTCCTCAACA TCGCGATCAT CATGGGCTTC
CTGGTGCTGG TGGTCTACTC CAAGCACCTG CACATCGGCC TGGCGCCGAT CAACGTCATC
CTCAAGCGGG AGCCGGTCGC GCTCGGCCCG CTGGGCACCA CGCCCGACAT CGAGAAGCTG
ATGGAGGAGG ACGAGCCGAT CGTCGGGGTC GGCAAGGTCG AGGACTTCTC CTGGAAGGCC
ATGCTCGACT TCTCCACCTG CACCGAGTGC GGGCGGTGCC AGAGCCAGTG CCCGGCCTGG
AACACCGGCA AGCCGCTGTC GCCCAAGCTC CTGATCATGG ACCTCCGGGA CCACCTCTTC
GCGAAGGCTC CCTACCTGCT CGCGCCCAAG GGCGCCGAGG ACGGCGAGAG CGCCGAGGAG
ACCACGAAGG CGGCCACCAG CGCGTCCGAG GACGGCTCCG GCAAGCACAA GGTGCACCAC
GTGCCCGAGT CCGGCTTCGG CCGTGTCCCC GAGCCCGGTC AGCCCCAGGT GGACCGCCCG
CTCGTCGGCA CCGCGGAGGA GGGCGGGGTC ATCGACCCCG ACGTCCTGTG GTCGTGCACC
AACTGCGGGG CCTGTGTCGA GCAGTGCCCG GTGGACATCG AGCACGTCGA CCACATCGTC
GACATGCGCC GCTACCAGGT CATGATCGAG TCGGCGTTCC CGTCCGAGGC CGGCGTGATG
CTGCGCAACC TGGAGAACAA CGGCAACCCG TGGGGTGTCT CGCCGCGCTC GCGCACCGAG
TGGACCGACG GCCTGCCGTT CGAGGTGCGC ATCCTCGACG AGGGCGAGCA GATCCCGGAC
GAGGTCGAGT ACCTCTACTG GGTCGGCTGC GCCGGCGCCA TCGAGGACCG GGCCAAGAAG
GTCGCGCGCT CCTTCGCCGA GCTGCTGCAC ACCGCCGGGG TCGAGTTCGC CATCCTCGGC
AGCCAGGAGT CCTGCACCGG TGACCCGGCG CGCCGCCTCG GCAACGAGTA CCTCTACCAG
GAGATGGCGA AGGCCAACAT CGAGCTGTTG AACGAGACGG GCGTCAAGAA GATCGTCGCG
ACCTGCCCGC ACTGCTTCAA CAGCCTCGCC CGGGAGTACT CCTCGCTCGG CGGGACGTTC
GAGGTCGTCC ACCACACGCA GCTGCTCGGC AAGCTCGTCG AGGAGCGCAA GCTCGTCCCG
ATCACCCCGA TCGACTCCTC GGTGACCTAC CACGACCCGT GCTTCCTCGG CCGGCACAAC
AAGGTCTACA CCCCGCCCCG GGAGATCCTC GAGGCCATCC CGGGCATCCG TGGCCAGGAG
ATGCACCGCT GCAAGGACCG TGGCTTCTGC TGCGGCGCCG GCGGCGCGCG GATGTGGATG
GAGGAGAAGA TCGGAAAGCG GGTCAACGTC GACCGCATGG AGGAGGCCCT CGGCCTCGAT
CCCGATGTGG TCTCGACGGC CTGCCCGTTC TGCATCGTGA TGCTCTCCGA CGCCGTCACC
GAGAAGAAGC TCGCCGGCGA GGCCAAGGAG AGCGTCGAGG TGCTCGACGT CTCGCAGCTG
CTGGCCCGTT CGCTGGTCGC GCCGACGCCC ACGCCGGCAG CGGAGCCGCT GAGCAGCTGA
 
Protein sequence
MRIAIGLAIT LIALAVAGRR VFWLTRLIRS GQPAEGRLDD LPTRIWTEIS EVGGQRKLLK 
WSVPGLAHAF TFWGFTILGL TIVEAYGALF DHDFHIPLFG HWAAIGFLED FFAVAVLAGL
ITFTAIRLRN APARLDRKSR FYGSHIGPAW VILGMITLVI VTLLITRGAQ FNAGTHPQGD
TKWAFASWLV SLPLDAFSEH TNEHIETVFL LLNIAIIMGF LVLVVYSKHL HIGLAPINVI
LKREPVALGP LGTTPDIEKL MEEDEPIVGV GKVEDFSWKA MLDFSTCTEC GRCQSQCPAW
NTGKPLSPKL LIMDLRDHLF AKAPYLLAPK GAEDGESAEE TTKAATSASE DGSGKHKVHH
VPESGFGRVP EPGQPQVDRP LVGTAEEGGV IDPDVLWSCT NCGACVEQCP VDIEHVDHIV
DMRRYQVMIE SAFPSEAGVM LRNLENNGNP WGVSPRSRTE WTDGLPFEVR ILDEGEQIPD
EVEYLYWVGC AGAIEDRAKK VARSFAELLH TAGVEFAILG SQESCTGDPA RRLGNEYLYQ
EMAKANIELL NETGVKKIVA TCPHCFNSLA REYSSLGGTF EVVHHTQLLG KLVEERKLVP
ITPIDSSVTY HDPCFLGRHN KVYTPPREIL EAIPGIRGQE MHRCKDRGFC CGAGGARMWM
EEKIGKRVNV DRMEEALGLD PDVVSTACPF CIVMLSDAVT EKKLAGEAKE SVEVLDVSQL
LARSLVAPTP TPAAEPLSS