Gene Franean1_3791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3791 
Symbol 
ID5672155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4494953 
End bp4497442 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content68% 
IMG OID641242670 
Producthypothetical protein 
Protein accessionYP_001508090 
Protein GI158315582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.805476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCA GGGGAATGCT GGCGCGGCGC CAGTCCACAT CTCCTCCCCC TGCTGGCGCG 
AAGCCGGATG CCGGCCCGGG GGCGAGCATC CGGCTTCGGA ACGCGTCTGG CGTCCCGCAG
GTCCAGCGCG AGGACGTCCC TGCCGTGACG ACGACCCCGA AGGAGCTCAT TTCGAAGTAC
ACGACGCTCC TCATGCTCGA CGAGGCGGGG CTCGGGAAGG ATCTCGCCGC GCGCGTGGTG
GAAAGACGGG ACGTCATGCT CACTCACGGG GTGTTCGACA ACCTCAGGAC CACCGACCGC
GACGATGTCG CCGAGGAACT GGCCCGGTCT GCCGGCAGCA GGCTCCGGGA GGTGGACGAG
GGACTCCGGA TCCGCCTCAT CCGGGAGATG CTCGACACCG TCGTCGACGA GGACGCGGAA
AAGCAGGTCA CGGCGGTCTG GCTGAGCTTC GAGAACGAAG GGAAGCTGGG AACGGTCATC
ACCAACCAGC GCGCCCTGTG GGATCGCTCG CTCAGCGAGT GCACGCCGCT GAGCGACCGT
TTCGCCGCCT ACACGGCAGC GTTCGGCATG GATATCCTGA AGGCAGCCGG CGTCTATCTC
GACGAGAACC GAAAGATCAC GGAGAAGGAA GCCAAGGGGT TCGGGCTTAA TCTCGGTGGT
GCGGCCGAGA CGGCACCGCC GGAGCAGGCG GACGCCTATC TCGACGAGGT GCGCAAGGCG
GCCAGGGTCG TGGTGCAGCT GCAGGCCCTG ACGTATGAGC TGCGGCTGGT GCCTGTCGGC
TATCGCAGCA CCCCCGAGCT CCAACAGAAG CTTTCGGCGG ACGCCAGGAC AGACGCGGAG
GCCTACAAGG CCGTCACCGC GCCGCCCCCA GACACGCTAC GGAGGTACGG GCAGCCGGAG
CTCTTCGATC CCGCCCGGCG CCCCGACTAC CCGCCGACCG GCAAGGAGGA GCCGGAGATG
GCGCAGTGGA CGGAGGCGGA GTCACACCAC AGCAAGATCG GGGCGCTGAT CGCCGAGTAC
GCCCGAAACT ACCCCGCCGT CTACGCGTCG ATCACGCAGG GAAACATCGG GGAGCTGGCC
GAGGCGACGG ACGCAGGCAA GGCCCGCGGA CTGGTGGAGA ACATTCTCCG GAAGACGCTT
ACGGCGATAG AAGAGACCAG AAGCAGACTC GGGACGGGTA TCACACACTA CGATCTCGCC
CCGATTCAGC AGCAGCTGTT TACGAACACC CTCGCCACGC CGGCCAAAGC GTCCGTCAAC
TGGCAGGATC CGCTGTACGC GGCGCTCGGG CGCATGGATC TGGAAAAGCA GAAGGCGAAG
GACTTCTGGA CCGACCTGGG ACTCAATATG GTGTCGGCGT TCGCGCTGAT CGCCGCGCCG
TTCACCGGAG GGATGACGGC CGCCGTTCTG GTCGGAGCCG GCCTCGCCGT CGGCGCCGGC
ATGGCCGCCG CGAGCTGGGA CCGCTATCTC CAGCTCCGCC CACTCGAGAA CGCCGCGATC
AGGGACGACC TGTCCTTCGT CCAGAAAGGA GCCGTCGACG CGGCGCTCCT CGAGGCGACG
GTCGCGACGG TCGGCGTGTT CCTGGACGCT CTGGGTGTGC ATGGGGACAT GAAGCGGGCC
GCCGGCGTCA ACCGGGTGAA GGCACTGGCG GACCTCCACG GGGCGGTCGA GGCGCAGGAG
GCCGCCGCGC AGGCGCGCAT GAAGATGCAG GCGGAGGGTC TGAAGGACGC TGGCGCGGCC
ACCGCGGGAG CGGCGGCCGC GATCGGGGCC CACGAGCTCG AGGACGCCAT CCCGGAGCCC
GACATCGAGG TGAGGGCGGG CGGGCTGGAG GTCGATCTGC CCTCGGTCGC AGCCCCGGCA
CAACGTAGCG TGATCAGCAC CAGAAGTGTT CAGCGCGCGC CGAACAAGCA GCTCGCAGCC
GTCACTGCGA AAATGGCCGC GATCGATACC GCGAGAGATG TTCCGCGCGA CTGGGGAAAC
CGCTTCGAGC TGTCGGTGGT GGCGAGCGTT CTCCGCGGCG AGGTCCCGGA GATGTCCGGT
GTGGTCCACG CGTTTCAGGC GCAGCACAAC GCGAGTGGAC ACGGGATCGA CATCATCGCC
GTCGGGACCG GCTCCAGGGG AAGGCTCAAG TTCTGGCAGA TCGAGTGCAA GTGGGCCGGG
CCGGAATCGG GCTACCCGCG GCACCTCGGC GGATCACGCG CCGGCATCCA GACCAGCGCC
GGATGGACCA AGGACAACTT CGTCAGATGG TGGGAGGCCG CTCCCCCGGG GGAGAAACGG
CAGCTCCTCA ACGCCGTGAA GGCAGCGAAC GGCGGCCGCG CGATCGAGGT GGAGAAGCTG
ACGGATCTGA TCAGCAGGGC TGAGGTGATC ATAGCCGCGC CGCTCGGCGC CGGAGCCGCC
GGCGTGATGC GCAGGATATG GGGCGAGATG GGTGCGCTCA CGCGGTTCGG CGGCCGGAAG
ATGAGCTACC GGGAGTTCCG ACCGAGATGA
 
Protein sequence
MAVRGMLARR QSTSPPPAGA KPDAGPGASI RLRNASGVPQ VQREDVPAVT TTPKELISKY 
TTLLMLDEAG LGKDLAARVV ERRDVMLTHG VFDNLRTTDR DDVAEELARS AGSRLREVDE
GLRIRLIREM LDTVVDEDAE KQVTAVWLSF ENEGKLGTVI TNQRALWDRS LSECTPLSDR
FAAYTAAFGM DILKAAGVYL DENRKITEKE AKGFGLNLGG AAETAPPEQA DAYLDEVRKA
ARVVVQLQAL TYELRLVPVG YRSTPELQQK LSADARTDAE AYKAVTAPPP DTLRRYGQPE
LFDPARRPDY PPTGKEEPEM AQWTEAESHH SKIGALIAEY ARNYPAVYAS ITQGNIGELA
EATDAGKARG LVENILRKTL TAIEETRSRL GTGITHYDLA PIQQQLFTNT LATPAKASVN
WQDPLYAALG RMDLEKQKAK DFWTDLGLNM VSAFALIAAP FTGGMTAAVL VGAGLAVGAG
MAAASWDRYL QLRPLENAAI RDDLSFVQKG AVDAALLEAT VATVGVFLDA LGVHGDMKRA
AGVNRVKALA DLHGAVEAQE AAAQARMKMQ AEGLKDAGAA TAGAAAAIGA HELEDAIPEP
DIEVRAGGLE VDLPSVAAPA QRSVISTRSV QRAPNKQLAA VTAKMAAIDT ARDVPRDWGN
RFELSVVASV LRGEVPEMSG VVHAFQAQHN ASGHGIDIIA VGTGSRGRLK FWQIECKWAG
PESGYPRHLG GSRAGIQTSA GWTKDNFVRW WEAAPPGEKR QLLNAVKAAN GGRAIEVEKL
TDLISRAEVI IAAPLGAGAA GVMRRIWGEM GALTRFGGRK MSYREFRPR