Gene Franean1_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4063 
Symbol 
ID5672421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4843622 
End bp4845349 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID641242939 
Producthypothetical protein 
Protein accessionYP_001508356 
Protein GI158315848 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.824829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGTGC AGAAGGACGC CTTTCCAGGG CAGGACACCG AGGTCCGCCC CGCGCCGTCG 
GAAATCCCGG GCCCTGCGCC CGTCCGGGAG CAGGCGGCCC GGGAAATCCG TCTCGACAGC
CGCGAGACCG TGCAGACTCC CACCGCCGTG GTCTTTCTCA CCGAGGACCG CGCGTACAAG
CTGCGCCGGG CGGTCAACCA CGGTTTCGTG GACTACCGCT CCCGCCGGGC CCGGCTGATC
GCCTGCGAGG ACGAGGTACG GCTCAACCGG CGCCTCGCCC CGGACGTGTA CCTCGGCGTG
GCCGACATCC GGGACGAGAC CGGGGCACTG CGCGACCACA TGGTCGTCAT GCGACGGCTG
CCGGCCGACC GCCGGCTCTC CGCGCTGATG ACAGCCGACG TCTCCGGCGA GCTGCGTGAG
CTCGCCCAGC GGATCGCGGC GTTCCACGAA GGGTGCGAGA CCACGCCCGA GATCACCCGC
ACCGGTGGTC TGTGCGCGTT GGAGGCACTC TGGCTGGAGG CGATGGACGG CCTCGCGCCG
TTCCGCGGCC GGATCCTCGA CGCCGCCACC GTCGACGAGA TCGGCCGGCT CGCGCTGCGC
TACCTGACCG GCCGGGGCCC ACTACTCGCG GAGCGCCAGG CCGCCGGCCG GATCCGCGAC
GGCCACGGCG ACCTGCTCGC CGACGACATC TACTGCCTGA ACGACGGCCC GCGGGTCCTC
AACTGCGTCA ACGTCGACCC CGCGCTGCGG GCCGGTGACG TCCTCGGCGA CGCAGCCTCC
CTCGCGATGG ACCTCGAACG GCTCGGCAAC GCCACCGCGG CCCGGACGTT CCTCGACGCC
TACCGTGAGT TCTCGGGCGA GACCCATCCA ACGTCGCTGG AGGACCTCTA CATCGCCTAC
CGGGCGGTCG TCCGCGCCAA GACCGCCTGC GTCCGCGACC ACCAGGGTGA CCCGGCCGCC
GCCGACGAAG CCCGCCGGCT CACCGACCTC GCGCTACGCC ACCTACGGCG CGGCCGTCCC
CGGCTCATCC TGGTCGGCGG CCTGCCCGGC ACCGGTAAGT CGACACTGGC CAGCCATCTC
GTCTCCGGCG AGGATGACTG GGTGCTGCTG AGCTCGGCCG CCGTCCGCGG CGAGCCCGTC
GGAGCGGGCG CGACCGCCCC CGAGTCCGCC TCCACTTCCG CCTCCGATTC GGCCGGGACG
GAGCCCGCGG CGGGGTGCTA CGGCGCGGAC GCGACCGAGC ACAGCTACGT GGAGGTGCTC
ACGCGGGCCC GCCACGCGCT CGAAAGAGGG AGGAGCGTCG TGATCGATGC CTCCTGGTCC
TCGAGGCGGA TGCGCGCACG GGCCGCCGAG CTGGCGGCGG AGTGCGACGC CGACCTGATG
CAGCTGCGGT GCGTGGTCCC GCCCCGGGTC GCGGTCGCCC GCATAGCCGA CCGCGCGACC
GTCCCCATCG CACTCGGCTC CACCACGGAC CGGTCCGGTC CGGACCACTC GACCGCGACC
CGTTCCGCTC CGGCGGGCGT CGTTCCGATC GGTGCCCTTC CGATCGGCGC CGTCGCGGAT
GGTGCCACCA CCGGGCACCG CGCCGACCTG GTCAACGCCC TGACTCCCAA CGAGTGGATC
TACCTGGACG TGGCCACCCG CACCGATCCG TGGCCCGACG CACGCGACAT CGACACCTCC
GCCCCAGCCG AACACGCAGT CACCGCCGCG TACCTTCTGA TCAACTAG
 
Protein sequence
MFVQKDAFPG QDTEVRPAPS EIPGPAPVRE QAAREIRLDS RETVQTPTAV VFLTEDRAYK 
LRRAVNHGFV DYRSRRARLI ACEDEVRLNR RLAPDVYLGV ADIRDETGAL RDHMVVMRRL
PADRRLSALM TADVSGELRE LAQRIAAFHE GCETTPEITR TGGLCALEAL WLEAMDGLAP
FRGRILDAAT VDEIGRLALR YLTGRGPLLA ERQAAGRIRD GHGDLLADDI YCLNDGPRVL
NCVNVDPALR AGDVLGDAAS LAMDLERLGN ATAARTFLDA YREFSGETHP TSLEDLYIAY
RAVVRAKTAC VRDHQGDPAA ADEARRLTDL ALRHLRRGRP RLILVGGLPG TGKSTLASHL
VSGEDDWVLL SSAAVRGEPV GAGATAPESA STSASDSAGT EPAAGCYGAD ATEHSYVEVL
TRARHALERG RSVVIDASWS SRRMRARAAE LAAECDADLM QLRCVVPPRV AVARIADRAT
VPIALGSTTD RSGPDHSTAT RSAPAGVVPI GALPIGAVAD GATTGHRADL VNALTPNEWI
YLDVATRTDP WPDARDIDTS APAEHAVTAA YLLIN