Gene Francci3_2407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2407 
Symbol 
ID3906390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2790337 
End bp2791557 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID637879737 
ProductFMN-dependent alpha-hydroxy acid dehydrogenase 
Protein accessionYP_481503 
Protein GI86741103 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.463668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0231198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGTGC AGGACGCAAT CAATGTCGAG GACTTCCGGG AACTGGCGCG CCGCAGGCTT 
CCCCGGGCGG TTTTCGACGC CATGGAGGGC GGCGCCGGCG ACGAGGTGTC GCTCAGGCGT
AACCGCACGG CGTTCGACCG CATCGAGTTC CGCCCGCGGC CGCTCGCCGA CGTCGCCACA
CGAGACCTCT CGACTACCGT GTTCGGCGAG CGGCTGTCGA TGCCGATCAT GCTGGCGCCC
ACGGGCGCGG GCCGCCTCGC GCGCTCGTCC GCGGAGATCG CTGTGGCGAG GGCCGCGGCG
CGGGCGGACA TCGTGTACAT GCAGAGCACC GTGGCAGCCT TCCCGCTCGA GGATGTCGCC
GCGCGCTCGA CCGGCACGCT GTGGTATCAG CTCTACCTGC CCCCCAACCG CGCCGAGGTC
GGGAACCTCG TCCGGCGGAT CGCGGCGGCG GGTTACCGGG CCCTGGCCAT CACCATCGAC
ACGCCGGTCC TGGGCAACCG TGAACGCGAC ACCCGCAACA AGCTCATGAG TCGGCCGCCG
CATCCCAAGA TGCTGCTGCA GGGGGCCAGC AAGCCGGCGT GGGCGACCGA TTTCATCCGC
GGCAAATTCG ACTTCATGCG CAAATTCGAC TTCATGCGCA AATTCGACTT CATGCGCGGC
CAATTCGGCG CCGGCTGTCC CGGCGGCCCG TCCCGGCTAA GCCTCAACCA GACCCGAGCG
ACAATCAAAT CGTCGTCGGA CTGCGTCACC TGGGAGGACG TCGAGCGGAT CCGGTCGTTG
TGGGAAGGCC CGTTGCTCCT CAAGGGCCTG ATGCGCGGGG ACGAGTGCGA CCGGCTCGTC
GAGCTGGGCG TCGACGGTGT CGTGGTCTCG AACCATGGCG GGCGGCAGTT GGACGGCGTC
CCCGCGACCA TCGATATTCT GCCGGAGGTG GTTGACGCGG CGGCGCGCAG ACTCACGGTG
TTCCTCGACG GAGGTGTCCG GCGAGGCAAC GACGTGGTCA AGGCGCTGGC CCTCGGCGCC
GCAGGGGTCT TCGTGGGTCG GCCCTACCTG TACGGCCTCG CCGCGGGCGG CGAGGCCGGT
GTCCTACGGA TGATTGAACT GCTGCGCGTG GAATTCGACC ACGCGATGGC ACTGCTCGGC
GCCGCGACCG TGGCGGACCT CGACCGCAGC CTCGTCTCGG GCGCACGTAT CCCCAGCACC
CTCTCACCGT CGACGCGATA G
 
Protein sequence
MRVQDAINVE DFRELARRRL PRAVFDAMEG GAGDEVSLRR NRTAFDRIEF RPRPLADVAT 
RDLSTTVFGE RLSMPIMLAP TGAGRLARSS AEIAVARAAA RADIVYMQST VAAFPLEDVA
ARSTGTLWYQ LYLPPNRAEV GNLVRRIAAA GYRALAITID TPVLGNRERD TRNKLMSRPP
HPKMLLQGAS KPAWATDFIR GKFDFMRKFD FMRKFDFMRG QFGAGCPGGP SRLSLNQTRA
TIKSSSDCVT WEDVERIRSL WEGPLLLKGL MRGDECDRLV ELGVDGVVVS NHGGRQLDGV
PATIDILPEV VDAAARRLTV FLDGGVRRGN DVVKALALGA AGVFVGRPYL YGLAAGGEAG
VLRMIELLRV EFDHAMALLG AATVADLDRS LVSGARIPST LSPSTR