Gene Franean1_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4898 
Symbol 
ID5673238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5880199 
End bp5881683 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content65% 
IMG OID641243753 
Producthypothetical protein 
Protein accessionYP_001509169 
Protein GI158316661 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00550747 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0440629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAA AGCCAGAGGT AGTCGTCATC GGAGCCGGGC CGGCCGGTCT CTCCGCCGGC 
TGGGAGCTGA TGAAGCGGGA GATCCCCGTG ACGATTATCG AGGGTGACTC GGTGGTCGGC
GGAATCAGCC GTACGGCCCA GCGGGACGGA TGGCGTTTCG ACATCGGGGG CCACCGTTTC
TTCACCAAGG TCCCCGAGGT CGAGAAGCTG TGGCACGAGA TCCTGCCGGA CGAGGACTTC
CTACTCCGGC CGCGGTCGAG CCGCATCTAC TACAACGGCA AGTTCTTCGA CTACCCACTG
AAGGCCGGTA ACGCCCTCGG CGGGCTCGGG GTCGCCGAGG CGGCACGCTG CATCGGCTCG
TACGCGCTGG CGAAGCTGCG CCCGCCGAAG GACCAGTCGA ACTACGAGAA CTGGCTGGTC
GCCCGGTTCG GCTGGCGGCT CTACCGCACC TTCTTCAAGA CCTACACCGA GAAGCTCTGG
GGTGTGAAGG TCAGCGAGAT GCCGTCCGAC TGGGCCGCCC AGCGGATCAA GAGCCTCTCT
CTGATGAACG CCATCACCAA CGCGGTGCTG CCCAAGCGGA ACCAGAAGGA CATCACCTCC
CTCATCGAGG AGTTCCAGTA CCCGAAGTTC GGGCCGGGAA TGATGTGGGA GACGGCCGCG
GACAAGATCG TCAAGCAGGG CGGTCGGATC GTCTTCGAGG AGAAGGTCCG CAAGATCCAT
CACGAGAACG GGCGCGCGAC CGGTGTCACG ACAGTGGTCA CTGGCGGCTA CGGGCCGGGC
GCCGGCGCGC CGGAGTCGTC CCGGGATGAC CTAGGCACCG AGTACCAGTA CACCGGCGAC
CACTTCATCT CCTCGATGTC GTTCTCGTCG CTGGTGCGCG TGATGGACCC GCCGGTCCCG
GCGCGCGTCC TGGCCGCCGC GAACGCCCTG AAGTACCGCG ACTTCCTCAC CGTCGCGCTG
GTCGTTCCCA AACCGGCCGG ATTCCCAGAC AACTGGATCT ACATCCACGC TCCGGACGTC
AAGGTCGGCC GCATCCAGAA CTTCGCGTCC TGGTCGCCGT TCCTGGTGAA GGACGGCCGG
ACCTGTCTCG GCCTGGAGTA CTTCGTCTTC GAGGGCGACG AGATGTGGAA CTCCTCGGAC
GAGGAGCTGA TCGCGCTCGG CACCAAGGAG CTGGCCAAGC TGGGCCTCGT CCAGGCGGAC
CAGGTCGAGG GCGGCTATGT CGTGCGGATG CCCAAGGCGT ACCCGTACTA CGACATGGAC
TACAAGAAGA ACGTCGACAT CATCCGCGGC TGGCTCGAGG ACTACGCTCC CAACGTCCAC
CCCGTCGGCC GTAACGGAAT GCACCGCTAC AACAACCAGG ACCACTCGAT GCTCACCGCG
ATGCTCACCG TCGAGAACAT CATCGACGGG AAGAGCCACG ACGTGTGGGA GGTCAACGTC
GAGGAGGACT ACCACGAGGA GGTCTCCTCC CCGGGCCGGT CGTAG
 
Protein sequence
MTEKPEVVVI GAGPAGLSAG WELMKREIPV TIIEGDSVVG GISRTAQRDG WRFDIGGHRF 
FTKVPEVEKL WHEILPDEDF LLRPRSSRIY YNGKFFDYPL KAGNALGGLG VAEAARCIGS
YALAKLRPPK DQSNYENWLV ARFGWRLYRT FFKTYTEKLW GVKVSEMPSD WAAQRIKSLS
LMNAITNAVL PKRNQKDITS LIEEFQYPKF GPGMMWETAA DKIVKQGGRI VFEEKVRKIH
HENGRATGVT TVVTGGYGPG AGAPESSRDD LGTEYQYTGD HFISSMSFSS LVRVMDPPVP
ARVLAAANAL KYRDFLTVAL VVPKPAGFPD NWIYIHAPDV KVGRIQNFAS WSPFLVKDGR
TCLGLEYFVF EGDEMWNSSD EELIALGTKE LAKLGLVQAD QVEGGYVVRM PKAYPYYDMD
YKKNVDIIRG WLEDYAPNVH PVGRNGMHRY NNQDHSMLTA MLTVENIIDG KSHDVWEVNV
EEDYHEEVSS PGRS