Gene Franean1_1656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1656 
Symbol 
ID5670058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1978461 
End bp1980530 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content75% 
IMG OID641240574 
Producthypothetical protein 
Protein accessionYP_001506000 
Protein GI158313492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.419184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGTTGCC TGATCGTCAC GCTGTCAGAG CGCCGATCAA GCCGAAAAGA GCAGCCGGTC 
AGTAAAGTTT CGTGGGTGTT TCGACCGGAC CTGGAAAGCC GTCAGATTTC TCTCCCCGAG
CAGACCGTCG AGCTGCCGGC GGACCTCCTG GATCCGCTCG CGCACGTCGT CGTGGAGGTA
TTTCCGCAGC TGGTGGCCCA TATTCCGCCC ACCCCCGCGG CTGAGCTCGG CCGGCTCGGC
GCGACGGCGC CCGCTCCGGT CCCGCTCCCG CCCGCACCCG CCGGACCGGT GTTGACCGCG
TTGCGTTCGG CCGGGCCGCG CCTACCGGAA CGACTGCTCG GCGCGCTCGA GCTCGCCATC
GACGAGCTTC CGCTGCGCGC CCCGCGCGGC CTGGCCAGGG GCCTGGTCGA CGCCCCGCCG
AACGGGCCGC GCGGCAGGAA CGGATCGCGC GCGTCCGCGG CCGCCGCAGG CGAGTCCGAG
GCGGTCCAGG CGGTCGCGAT CCTGGATGAG GTCCATCCCG GGGTGTCCGA TCTGATCCAC
AGCTACCTGC GCGCGCTGGT CGAGCATCCC GCCGTGGCCC CGCTGCTGGC GGCGGACGAC
CCTGCCGCGG CCGGCCCGCT CGACGGCCAC GGCTCGGACG ACCGCCCGCT CGACGGCACC
GCATTCGGCA GCACCGCGAA CGGCGGGTCC GCGTTCGGCG GGTCCGCGTT CGGCGGGAGC
CTGTTCGGGC CGCCGGACGA GCCGGCCGAC GAGCCCACCC TGGCCGCCCG GCACGGGGCC
GCCCACCTGG CGCTCGCGGT CACCGTCGCC GCCGCCGTCC TGCGGGAGCT GGACCCGCCG
ATGATCGGCA CCGGCGCGCC GGGGATCGTC GGCACGGCCG TGGGGTCCGT CGCGCTGGTG
CTGCCGGGCC GGCCGATGCC GCTCGCCTAC CCGGCCGCGC TGCTGGCCCG GCGCCGCGCG
GAGTACCGGC TGCCGCGGCA GGCCGCCGGC TGCGTCACCG TCGACGGGCA CTGCTTCGGG
CTGGTGGAGG GCGAGCTGCC GAATCCGCCG AGCTTCGCCC GCAACGGCCT GGTGGAGGCC
GTCTCCGGCG GTGTCCTCGT CCGGACCGGC ATGGGCACGG GCCGGGTACG GGTGTCGCTG
CGGGTGCTGG CGACACCGCC GGCCCCGCCG GTCCCCGCCG ACGCCGTCCG CTGGGACGAG
ATCGTCGACG TGAGCTGGAC GGCCGCGAAC GGCGCCGCCG CGGTGGCCGG CGCCGGAGCG
AGCACCGCCG GCGGCACCAC AACAACGGGC GTCGGCGCCG GGGCCGGCTC AGCCACCGAC
CTCGGCCACC TGACGACACC GCCCTGGCCC GGGGACTACC GGCTGCGGGT TTACGCCCAC
GGCCGGGACG GCGCCGGCGA GGACGAGACC TACGAGCTGG TGGTGTGGAG CGCCCCGGCC
GCGCCCGAGA CCGTCCACCG GCGCACCGAC CAGCTGGGGC ACCGGCTGCG CGGCGAGGAG
CTTCCGCCGG TCGTGACGGT GCCGGAGACC CGCTACCGGT GGGTGCGCCG GCGCAGCGCC
TTCCGCGAGG CCGCCACGTT CACCATCGTG GTCGGCGCCT CGCCCGAGGA CGTCGTGCGC
TGCTTCGACG CCGATCCCGG CGCGCCGTGC TCGCTGTCCC GGCTGCGCGC CGACCGGCGC
ACCGACCCGT ACGTGCTGGT CCTGCCGCTC GACGGCGACG ACCGCGCGGT GCTCGCCGTC
GAGGACAACG GCTTCCAGGG CTCCCGGCAC CCGGTGCTGT CCGCGGTCTC CCGGCACGGC
CTGGCGGCGA GCATGTTCTG GAACATCAAC GCGCTCACCC GGCTCTCGCT CGCCCGGGAC
GGCGAGGTGC TCGCCGCGTT CGAACCGGGG CCGGACGCCG TCCCCGACGC GGTCGTGCCG
CTCCTGCGGG ACGTCGACCT GGCCGGCGCT ACGGACCGGG TCGCCAAGGG CCTCGTCGTC
GTCGAGCGGT TCACCGGCCA TCCGGTCCTC TCCGAGCACC TGGACCGGAT CATCGAGAAC
GACGTGGCAT ACCTGATCAA CCAGCACTGA
 
Protein sequence
MCCLIVTLSE RRSSRKEQPV SKVSWVFRPD LESRQISLPE QTVELPADLL DPLAHVVVEV 
FPQLVAHIPP TPAAELGRLG ATAPAPVPLP PAPAGPVLTA LRSAGPRLPE RLLGALELAI
DELPLRAPRG LARGLVDAPP NGPRGRNGSR ASAAAAGESE AVQAVAILDE VHPGVSDLIH
SYLRALVEHP AVAPLLAADD PAAAGPLDGH GSDDRPLDGT AFGSTANGGS AFGGSAFGGS
LFGPPDEPAD EPTLAARHGA AHLALAVTVA AAVLRELDPP MIGTGAPGIV GTAVGSVALV
LPGRPMPLAY PAALLARRRA EYRLPRQAAG CVTVDGHCFG LVEGELPNPP SFARNGLVEA
VSGGVLVRTG MGTGRVRVSL RVLATPPAPP VPADAVRWDE IVDVSWTAAN GAAAVAGAGA
STAGGTTTTG VGAGAGSATD LGHLTTPPWP GDYRLRVYAH GRDGAGEDET YELVVWSAPA
APETVHRRTD QLGHRLRGEE LPPVVTVPET RYRWVRRRSA FREAATFTIV VGASPEDVVR
CFDADPGAPC SLSRLRADRR TDPYVLVLPL DGDDRAVLAV EDNGFQGSRH PVLSAVSRHG
LAASMFWNIN ALTRLSLARD GEVLAAFEPG PDAVPDAVVP LLRDVDLAGA TDRVAKGLVV
VERFTGHPVL SEHLDRIIEN DVAYLINQH