Gene Francci3_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4446 
Symbol 
ID3907422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5313550 
End bp5315301 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content70% 
IMG OID637881778 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_483521 
Protein GI86743121 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGATT CCGGGTCGGA CGGCGGCTAT GACGCCGACG TGATCGTGAT CGGGAGCGGG 
TTCGGCGGCA GCGTCAGCGC GCTACGGCTC GCGGAGAAGG GCCATCGTGT CCTCGTGCTG
GAGGCCGGTC GCCGGTTCAC CCCGGACACG CTGCCGAAGA ACAGCTGGCA CGTTCGCCGG
TATCTGTGGA TGCCGCGGTT GGGCTGCCAC GGTATCCAGA AGATCACCTG GTTGGGTCGG
GTCGTGGTGC TGTCCGGAAC GGCGGTCGGT GGCGGCTCGG TCGTCTACGC GAACACGCTG
TACCGCCCGC TCGACGCCTT CTATACCGAC CCGCAGTGGC GGGACATCAC CGACTGGCGC
CGTGAGCTGG AGCCGTTCTT CGACCAGGCC GAACGCATGC TCGGGGTGAA TCCCAACCCG
ACGACGACGT ACTCCGACGA GGTGTTCCGC ACGGTCGCCG AGGAGATGCG GGTCGGGTCC
ACCTTCGCGG CGGCGCCGGT CGGTGTCTTC TTCGGCCGCG ACGGCAGCCG GGAACCCGGC
GTGCGTGTCC CTGACCCGTA CTTCGGCGGC GCCGGCCCGG CTCGGACGGG ATGCGTAGAG
TGCGGGGAGT GCATGAGCGG CTGCCGTCGA GGGGCCAAGA ACACCCTCGA CTGTAACTAC
CTGTACCTGG CCGAGCGGGC CGGTGTGCGG GTAGTCCCGG ACACCACGGT CACGGCTGTC
CGGCCACGCT CGACCGGCGG GTATGAGCTC GACATGGTCC GGACGGGGGG TCTCGTCCGG
CGTCGGCGAC GTACCCTCAC CGCGGAGCAG GTCGTGTTCG CCGCCGGCAC CCTGGGCACC
CAGCGGCTGC TGCTGGCGAT GAAACAGAGC GGTGACCTGC CGGCGTTGTC CGACCGGCTC
GGTCATCTCA CCCGGACGAA CTCCGAGGCG ATCCTGGGCG CATCCAGGCT GCGCCCCGAT
CAGCGGATCG CCCGGGGAGT CGCGATCACC TCGTCCTTCC ACCCCGACGA GCACACCCAC
ATCGAACCCG TGCGCTACGG CCGGGGGAGC AACCTGATGG CACTGATCAG CGCGTCGATG
ACCGACGGCG GCGGGCGGGT GCCGCGCTGG CTGAAATACC TGCGTCAGGT CGTGCTCCAC
CCACATCAGG CGCTGGCGTC GTCACTGCCC TGGCGATGGT CCGAGCGGAC CATCATTGCG
CTGGTCATGC AGTCGTGGGA CAACTCGTTG ATCGTCTCCC TGCGCCGTGG GCCGTTCGGT
CTCGGCTGGC TGACGAGTCG GCAGGGACAC GGTGAGCCGA ACCCCTCCTG GATCCCGGAG
GGCAACGATG CCGCCCGCCG GATCGCGGCC AGGATGGGCG GCTACCCCGG CGGGTCGATC
GGCGAGATCG CCAACATCCC GCTGACCGCG CACATCCTCG GCGGGGCCCC CATCGGCACG
GATCCCACCA CCGGAGTGAT CGATCCCTAC CACCGGGTCT TCGGATACGA GGGGCTGCAC
GTCGTCGACG GCGCCGCGGT CTCGGCGAAC CTCGGCGCGA ACCCGTCGTT GACCATCACT
GCGCAGGCCG AACGGGCGAT GTCCTTCTGG CCGAACAAGG GGGAACCCGA CCCGCGCCCA
CCTCTCGGGG CGGCCTACCG GCCGGTTGCG CCGAGAGCAC CGGCGCACCC CGCCGTCCCA
GCCGGCGCTC CCGGGACCTA CCGCGTGGTC GGCCCGGTCG GCCCGGTCGG ACCGGTCGCG
GGGACGAGGT GA
 
Protein sequence
MADSGSDGGY DADVIVIGSG FGGSVSALRL AEKGHRVLVL EAGRRFTPDT LPKNSWHVRR 
YLWMPRLGCH GIQKITWLGR VVVLSGTAVG GGSVVYANTL YRPLDAFYTD PQWRDITDWR
RELEPFFDQA ERMLGVNPNP TTTYSDEVFR TVAEEMRVGS TFAAAPVGVF FGRDGSREPG
VRVPDPYFGG AGPARTGCVE CGECMSGCRR GAKNTLDCNY LYLAERAGVR VVPDTTVTAV
RPRSTGGYEL DMVRTGGLVR RRRRTLTAEQ VVFAAGTLGT QRLLLAMKQS GDLPALSDRL
GHLTRTNSEA ILGASRLRPD QRIARGVAIT SSFHPDEHTH IEPVRYGRGS NLMALISASM
TDGGGRVPRW LKYLRQVVLH PHQALASSLP WRWSERTIIA LVMQSWDNSL IVSLRRGPFG
LGWLTSRQGH GEPNPSWIPE GNDAARRIAA RMGGYPGGSI GEIANIPLTA HILGGAPIGT
DPTTGVIDPY HRVFGYEGLH VVDGAAVSAN LGANPSLTIT AQAERAMSFW PNKGEPDPRP
PLGAAYRPVA PRAPAHPAVP AGAPGTYRVV GPVGPVGPVA GTR