Gene Franean1_4840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4840 
Symbol 
ID5673181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5803125 
End bp5804375 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content77% 
IMG OID641243696 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001509112 
Protein GI158316604 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.364458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0440629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCC CGCTGACGGG CGCACTGGGC GCGCAGCTGG TGCCTGTTCA GTCGCGGGTG 
ACCCAGCCGC AGCTCCAGCC CCAGCCGCCG CAGTCGCCGC GGCCGGTTCG GCTGGCGCCG
CCGCGGCCCG CGACCGGCGC CGTCCGGGTC GCCGTCATCG GGCTGGGCTG GGCCGGCAGG
TCGATCTGGC TGCCCCGGCT GCGTGACCAC CCGCGGTTCG CGGTCGCCGC GGCGGTCGAC
CTCGACGCGG ACGCCCGCGC CGCGGTTGCG GCGGACGGCG TCGACGCCCC GCTGCTTGCC
AGCCCGGATC TGCTGAGCCC CGACGAGATC GACCTGGCGG TCGTCGCCGT GCCGAACCAC
CTGCACAGTG TGGTCGGCGG CCGGTTGCTC GCCGCGGGCC TGCCGGTCTT CCTGGAGAAG
CCGGTCTGCC TCAGCGGCGC GCAGGCGGAG GAACTGGCCC GTGCTGAGCG GGCAGGTGGG
GCGGTCCTGC TCGCGGGCAG CGCCGCTCGC TGCCGGGCGG ACGTCCGGGC GCTCTACACC
CTCGCGCGGG CCTGCGGGAT GATCCGGCAC GTCGATCTCG CCTGGGTACG TGCCCGGGGC
GTGCCCGACG CGGGCGGCTG GTTCACCGAC ACCACGCGCG CCGGGGGCGG GGCGCTGCTC
GACCTCGGCT GGCACCTGCT CGACACGGTC GCTCCACTGG TCGGGACGGC GGACTTCACC
CAGGTCGTAG GCACGGTCTC CGCGGACTTC GTCCGCAGCG GGGCGGGGGG AGCCACCTGG
CGCCATGCCG GCGGCCCCGG CTCGGGCCGG AGCGAAGCGG ACCGCGGCCG GCGCGGCGAC
GTCGAGGACA CGGCCCGGGG GTTCCTGGTC ACCGAACGCG GTGTGTCGGT GTCACTGCGG
GCGAGCTGGG CCTCCCACGA GGCGCTCGAC AGCACGGTGA TCAGAGTGGA GGGCAGCGCG
GGAACCGCCA CGCTGACCTG CACCTTCGGC TTCAGCCCGA ACCGGCGCGA CGGATCGGTG
CTGACGTACA CGCGGGACGG CGACACCGTC CGGGTGCCGG TGCCGTCCGA GCCGGTGGGC
GCCGAGTACC GGCGCCAGCT CGACGAGCTC CCGGCCCTGC TCGCCGACCC GGGAGCGCGG
GGACGCGCGG TCGCGGAGGC GCGGCGTGCC GTCGACGCTG TGGAGCGGTT CTACCGTTCG
GCGCGCCCGC CAGGTGCGGC GGCGGGCGAC CACCGGTCGG GACGGATCTG A
 
Protein sequence
MSAPLTGALG AQLVPVQSRV TQPQLQPQPP QSPRPVRLAP PRPATGAVRV AVIGLGWAGR 
SIWLPRLRDH PRFAVAAAVD LDADARAAVA ADGVDAPLLA SPDLLSPDEI DLAVVAVPNH
LHSVVGGRLL AAGLPVFLEK PVCLSGAQAE ELARAERAGG AVLLAGSAAR CRADVRALYT
LARACGMIRH VDLAWVRARG VPDAGGWFTD TTRAGGGALL DLGWHLLDTV APLVGTADFT
QVVGTVSADF VRSGAGGATW RHAGGPGSGR SEADRGRRGD VEDTARGFLV TERGVSVSLR
ASWASHEALD STVIRVEGSA GTATLTCTFG FSPNRRDGSV LTYTRDGDTV RVPVPSEPVG
AEYRRQLDEL PALLADPGAR GRAVAEARRA VDAVERFYRS ARPPGAAAGD HRSGRI