Gene Franean1_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2072 
Symbol 
ID5670473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2495711 
End bp2496733 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content75% 
IMG OID641240994 
Productputative OxPP cycle protein OpcA 
Protein accessionYP_001506415 
Protein GI158313907 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3429] Glucose-6-P dehydrogenase subunit 
TIGRFAM ID[TIGR00534] opcA protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.833679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0837693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC TGTGGGACAC CACCGGATCG GCGGTGGTCA AGGCGCTGTC CGCCGAGCGG 
CGGGCCGCCG GCGCGCTCGC GTTCGGCCTG GCGCTGACGC TCGTGGTGGT CGTGGACGAG
CAGCACGTCA GCCAGGCGGA GAGCGCCGCC ACCGCGGCCG CCGCCGCCCA CCCGTGCCGG
CTGCTGATCG TCGTGCGCCG GCAGATCGAC TCGCCGCACC CTCGGCTGGA CGCCGAGGTG
TCGATCGGCG GCCGGCTGGG GCCCGGCGAG GCCGTCGTGA TGCGGATGTC GGGCCGGCTC
GCGCTGCACG CCGAGTCGGT CGTGCTCCCG CTGCTCGCGC CGGACGCCCC CGTGGTCACC
TGGTGGTACG ACGCTCCCCC GGAGAAGATC GCCTACGACC CGCTCGGCGT GTTCGCCGAC
CGCCGGGTCA CCGGGACCTA CGCGGCCCAC GACCCGCTGG CCGCGCTGCT GCAGCGGGCC
GAGGACTTCG TCCCCGGTGA CACCGACCTG GCCTGGACCC GCATCTCGGG GTGGCGCACC
CTGCTGGCGG CCGCGTTCGA CCAGGTCTCC GAGCCGGTGG GGCCGGCGAC GGTCGTCAGC
GAGCCGGGCA ACCCCAGCGC CCGCCTGTTC GCCGGCTGGC TGCAGGCGAA GCTGCGGGTC
CCGGTAACGA TCACCGACGC GCCGGGTAAG AAGGGCATCC AGAGCGTCCG CCTGGCGGTG
GGCGACGGCG AGCTCTCACT GGCCCGCACG GACAGCCGTT CGGCCGGTAT CACCCGCACG
GGTTACCCGA CCAGGGTGCT GCCGCTGCCC GAACGAGGGC TGGGTGACCT GCTCGCGGAG
GAGCTGCGCC GCCTCGACGA CGACAGCGTG TACGCCGAGG CGCTCTCGGC CTGGAGCGGC
GTCCCGGATC TGAACAGCCG ACCGCTGCAC CGCGAGCACA TCTGGCGCGA TCCGGCGCTG
GAGCGCTCCG AGGCGGCATT CGCGGCGATC CCACCCGCGC CGATCCCGCC CGCGGCGTCG
TGA
 
Protein sequence
MTTLWDTTGS AVVKALSAER RAAGALAFGL ALTLVVVVDE QHVSQAESAA TAAAAAHPCR 
LLIVVRRQID SPHPRLDAEV SIGGRLGPGE AVVMRMSGRL ALHAESVVLP LLAPDAPVVT
WWYDAPPEKI AYDPLGVFAD RRVTGTYAAH DPLAALLQRA EDFVPGDTDL AWTRISGWRT
LLAAAFDQVS EPVGPATVVS EPGNPSARLF AGWLQAKLRV PVTITDAPGK KGIQSVRLAV
GDGELSLART DSRSAGITRT GYPTRVLPLP ERGLGDLLAE ELRRLDDDSV YAEALSAWSG
VPDLNSRPLH REHIWRDPAL ERSEAAFAAI PPAPIPPAAS