Gene Francci3_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3360 
Symbol 
ID3905942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3988032 
End bp3989669 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content74% 
IMG OID637880683 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_482444 
Protein GI86742044 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0587927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCAC CGGGCTACGG ACCGGCTCCC ACCCCGCAGG AGCTGGCGGA CGAGTACGAC 
ATCTGCGTGG TGGGCAGCGG GGCGGCGGGT TCGGTCGTCG CCTGGCTGCT CGCCCGGGCC
GGGCTGTCGG TGGCCGTGGT GGAGCAGGGT GGGTTCGTCA CGGACGAGGA CAGCTACGAC
GACGTGCTGG CCGCGGGGGA GTCCGCCTGG GTGCGGCAGG AGAACGGCAC CTGGGCCAAG
GTGGGCTCAC CCTGGACGAC CTGCAACGTG GGTGGCGGCA CGCTGTTCTT CGGGGGGGTC
CTGTTCCGCC ACCGCCCGCT CGACTTCGAC CCGGAGACCG TTCTGGGCCG GGCCGACCTG
CCGCTGCGCT GGCCGCTTGA GCCGGCGGAG CTGACGGACT ACTACGACGC CGTCGAGGAT
CTGATCGGCG TCGCGGGTGT CGCGGACGGC GACCCCAGCC TGCCGGTCCG GTCCAGGCCG
TATCCGCTGC CCCCGGTGGC CACCACGGCG GAGGGGCGGC GGCTGACGGA GGCCGCGAGG
TCCATGGGCT GGGCACCGTT TCCCACCCCC CTGGGCGTCA ACAGCATCGA GTACCGCGGT
CGCCCGGTGT GCGCCGCGGA TGCCCCCTGC ATCTCACGGC GCTGCCCGAT CCACGCCAAG
GGGGACGCGC TGGACCGCTT CCTGCGGCCG GCGATGGCCG CGGGCGCACG CCTGTTCACC
GGGCTGAAGG CGGAGGCCCT GCTCGGGGAC GCGCGTCGCG ACGCCACCGC GTTGCGGTGC
GTCCGGATGC CGGACGGCGA GCGCGTCGTC CTGCGGGCCC GGCACTTCGT GCTGTGCGCG
AACGCCGTGC AGACTGCCGC GCTGCTGCTG CGTTCGACCA CCGTGCGGCA TCCGGCCGGG
CTGGGCAACT CACACGACAT GGTCGGCCGG GGGCTCTGCT TCAAGATCGG TGAATATCTG
GTCGGGTACT GCCACGAGCC GACCTCGGCG CCCGCCCGCA GCCGGCTGAT GGGCCTGGGA
CCCATCTCCA CCTGCTGCGT GACCGACCTC TACCAGGACC CGGCGGCGCC GGGCGGGCTG
GGTGGCCTGC TCTACGAGAA CCGGCCCGAG CGGACCTACC GGTTACGGGA CACCGAACAC
CTGCTGCGGA TCGAGGCGCT GGTACCGGAC GAACCCCAAC CGGGCAACCG GGTCCGGCTG
GGGCCGGGGA CCGACGCCCA CGGCGTGCCC GACGTCCTGA TGGACTACCA GGCCCATCCG
CGCGACCTCG CCCGCTCCGA GTACATGCTC GGGCAGGGCG AGGCGCTGCT GCGGGCCGCC
GGCTGCGACG TCATCGTGCG GGAGGCGTCC GGGTGGGCGC TCGGCAGCGG GCACCTGCAC
GGCACCTGCC GCATGGGTGA GGACCCGGCC ACCAGCGTGA CCGGGCCCGA CGGCCGCCTG
CACGACGCGG ACAACGTCTT CGTAGCCGAC GGCGGCCTGT TGCCGTTCCC CGGCGGGGTC
AATCCGACGC TGACCATCCA GGCGCTCGCG CTACGGGTGG CCCATCGGCT CCTCGCGGAG
CGCTACGCCA CCGGTCGCGT CCCGATCGGG GAACTGGTCG GGCCGAGCGT GACCGCGGCG
AACCGGTCGC CGAGGTAG
 
Protein sequence
MVPPGYGPAP TPQELADEYD ICVVGSGAAG SVVAWLLARA GLSVAVVEQG GFVTDEDSYD 
DVLAAGESAW VRQENGTWAK VGSPWTTCNV GGGTLFFGGV LFRHRPLDFD PETVLGRADL
PLRWPLEPAE LTDYYDAVED LIGVAGVADG DPSLPVRSRP YPLPPVATTA EGRRLTEAAR
SMGWAPFPTP LGVNSIEYRG RPVCAADAPC ISRRCPIHAK GDALDRFLRP AMAAGARLFT
GLKAEALLGD ARRDATALRC VRMPDGERVV LRARHFVLCA NAVQTAALLL RSTTVRHPAG
LGNSHDMVGR GLCFKIGEYL VGYCHEPTSA PARSRLMGLG PISTCCVTDL YQDPAAPGGL
GGLLYENRPE RTYRLRDTEH LLRIEALVPD EPQPGNRVRL GPGTDAHGVP DVLMDYQAHP
RDLARSEYML GQGEALLRAA GCDVIVREAS GWALGSGHLH GTCRMGEDPA TSVTGPDGRL
HDADNVFVAD GGLLPFPGGV NPTLTIQALA LRVAHRLLAE RYATGRVPIG ELVGPSVTAA
NRSPR