Gene Franean1_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0223 
Symbol 
ID5668648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp271810 
End bp273219 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content74% 
IMG OID641239152 
Productmembrane-flanked domain-containing protein 
Protein accessionYP_001504596 
Protein GI158312088 
COG category[S] Function unknown 
COG ID[COG3428] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.346085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAG CCGAGCCGGA GCCGTCGGTC GAGGACGCGG CTCCGCCGCT CCAGCCGCCG 
CCCGCCGGCG CGGAGCCGGA CGACGGCTAC CACCATCTTC ATCCGCTCAC GCCACTGCTG
CGCGGTTGGC TGCTGGTGGC GGCGGTCGCC GTCAGTATTC TGAAGAACTT CGCCGAGGAA
CTGACGGTTC GCAGTGTCAC GCTCACCCTC TCCGCGCTGC TGCCCGCGGC CTCGGCATAC
GGCTACTGCG CCTGGCGGTT CACCCGTTAC CGCGTGGAAG CCGAGGATCT GCGGCTGGAG
ACCGGCCTGC TGATCAAGCG TATCCGGCAT GTGCGGCTCG ACCGCATCCA GTCGGTGGAC
GTCGCCCAGC CGCTGATCGC CCGGGCGGCC GGGCTCGCCG TGCTGCGGCT GGACCTCGCC
GGCACGCACG ACGGGGACGC GGAGGAATCC GGCACGAGGC TCAGCTACCT CTCGCTCGAA
CGCGCGAGGC TGCTGCGGGC GGAGCTGCTC GCCCGCGCAG CCGGCGTGGC GCCCGGAGCC
GGAGAGGCAC CCGAACGCCC GCTCGCGTAC GTTCCGCCGC TGCGCCTGGC AGCCGCGATC
ACGCTCTCCC CCGCACCGTG GCTGGCACTC GGCGCGGCCT TCATCCTGGT CACCCCGGCC
CTGCTCACCG GCACCTTCGC CGGGTTCATC GCGGTCGCGC CCGCGCTGGT CGGAGTGTGG
CGGACGACCT TCGACCGCTT CGCGACCGGC TTCCGGTTCA CGGTGTCGGA ATCGCCGGAC
GGCCTGCGTA TCCGCGGCGG CCTGCTCGAC CGTGCCCATC ACACCGTGCC GCACGGCCGG
GTCCAGGCGG TGAGCCTGCG CGGCTCGCCG CTGTGGCGCT GGCTCGGCTG GGTGGAGCTC
CAGATCAACG TCGCCGGCGA CGCCCGGAGC ATGCTGCTGC CCGTCGCCCC CCGCGGGCAG
GCCGTGAACG TGATCGAACG GCTGCTGCCG GCCGTGGACG TCGACGGCAT GGCGCTGCCG
CGCCCGCCGG GCCGGGCCCG GCTGATCGCG CCGGTCCGGT GGCGCCGGAT GCGCTGCGGC
GCGGACGACC GGGTGTTCGT CGTCCGCGGC GGAGTGCTGT GGCAGCGCAC CGACATCATT
CCGCACGGCA AGGTGCAGAG CATCCGGGTC GTGCAGCACC CGGCCCATCG CCTGCTGCGC
CTCGCGGATG TCCACCTCGA CTGCGCGGGC GGCCCCGTGC GGATCCACGC CCGGCTGCTG
GACGTCCGCC AGGCGCACGC GATCGCGGCC GCGCAGGCCG AGCGCTCCCG GCTCAGCCGG
GTCGCCGCCG TACCGATCGT GACCGGCTCG CCGGCCTGGA CGCCGCTGCG CCCTCTGTCC
GGGGCACGCG ACCCCGACGC GGTGCACTGA
 
Protein sequence
MTRAEPEPSV EDAAPPLQPP PAGAEPDDGY HHLHPLTPLL RGWLLVAAVA VSILKNFAEE 
LTVRSVTLTL SALLPAASAY GYCAWRFTRY RVEAEDLRLE TGLLIKRIRH VRLDRIQSVD
VAQPLIARAA GLAVLRLDLA GTHDGDAEES GTRLSYLSLE RARLLRAELL ARAAGVAPGA
GEAPERPLAY VPPLRLAAAI TLSPAPWLAL GAAFILVTPA LLTGTFAGFI AVAPALVGVW
RTTFDRFATG FRFTVSESPD GLRIRGGLLD RAHHTVPHGR VQAVSLRGSP LWRWLGWVEL
QINVAGDARS MLLPVAPRGQ AVNVIERLLP AVDVDGMALP RPPGRARLIA PVRWRRMRCG
ADDRVFVVRG GVLWQRTDII PHGKVQSIRV VQHPAHRLLR LADVHLDCAG GPVRIHARLL
DVRQAHAIAA AQAERSRLSR VAAVPIVTGS PAWTPLRPLS GARDPDAVH