Gene Francci3_2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2520 
Symbol 
ID3904664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2976786 
End bp2978435 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID637879850 
Productmethane monooxygenase 
Protein accessionYP_481616 
Protein GI86741216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGTC AAAGCTTGAC AAAGGCACAC AAAAAGATCA CGGAGCTGTC ATGGGAACCG 
ACGTTCGCGA CGCCCGCGAA GCGGTTTGCC ACCGACTATA CCTTCGACAA GTCTCCGAAG
AAGGACCCGC TCAAGCAGAT CTTGCGCTCC TACTTCCCGA TGGAGGAGGA GAAGGACAAC
CGCGTGTACG GCGCGATGGA CGGCGCGATT CGCGGCAACA TGTTCCGCCA GGTCCAGCAG
CGCTGGATGG AGTGGCAGAA GCTCTTCCTG TCGATCATCC CGTTCCCGGA GATCTCCGCG
GCCCGGGCGA TGCCGATGGC GATCGATGCC GTCCCCAACC CTGAGATCCA CAACGGGCTC
GCGGTACAGA TGATCGACGA GGTGCGTCAC TCGACGATCC AGATGAACCT CAAGCGCCTG
TACATGAACC ACTACATCGA TCCGGCCGGC TTCGACATCA CTGAGAAGGC GTTCGCCAAC
AACTACGCCG GAACGATCGG CCGCCAGTTC GGTGAGGGCT TCATCACCGG TGACGCGATC
ACCGCGGCGA ACATCTACCT GACCGTCGTC GCCGAGACGG CGTTCACCAA CACCCTGTTT
GTCGCGATGC CGTCCGAGGC GGCCGCCAAC GGTGACTACC TGCTACCGAC GGTGTTCCAC
TCGGTCCAGT CCGACGAGTC TCGCCACATC AGCAACGGGT ACTCCATCCT GCTCATGGCG
CTCGCCGACG AGGCCAACCG GCCGCTGCTC GAGCGTGACC TGCGCTACGC GTGGTGGAAC
AACCACGCCG TGGTGGATGC CGCGATCGGC ACCTTCATCG AGTACGGCAC GAAGGACCGC
CGCAAGGACC GCGAGAGCTA CGCGGAGATG TGGCAGCGCT GGATCTACGA CGACTACTAC
CGCAGCTACC TCATCCCGCT CGAGAAGTAC GGCCTGGTGA TCCCCCACGA CCTGGTCGAG
GAAGCCTGGA ACCGTATCTA CAACAAGGGG TACGTGCACG AGGTCGCGCA GTTCTTCGCC
ACCGGATGGC CGGTGAACTA CTGGCGGATC GACCCGATGA CCGACGACGA CTTCGAGTGG
TTCGAGTCGA AGTACCCGGG CTGGTACAAC AAGTACGGCA AGTGGTGGGA GAACTACAAC
CGGATGCGTT ACCCCGGCCG GAACAAGCCC ATCGCGTTCG AGAACGTCGA CTACCAGTAC
CCCCAGCGGT GCTGGACGTG CATGGTGCCC TGCGTGATCC GCGAGGACAT GGTGCATGAC
AAGGTGGACG ACCAGTGGCG CACCTACTGC TCCGAAGCCT GTCACTGGAC TGACAAGGTG
GCGTTCCGGC CGGAGTACCA GGGCCGCCCG ACGCCGAACA TGGGCCGGCT GACCGGCAAG
CGGGAGTGGG AGACTCTTTA CCACAACTGG GATCTCGCCG ATGTCATTTC CGACCTCGGG
TACGTCCGTG ATGACGGCAA GACCCTGATT CCGCAGCCCC ATCTCGAGCT CGACGACCCC
TCGAAGCTCT GGACGCTCGA CGATGTCCGC GGCATCACGT TCAGCAGTCC GAATGTCGCA
CTCAACGAGA TGAACGACGC CGAGCGTGAG GCGGCGATGG CGGCCTACCG GGCCGGCGGT
CCGGCTGGTC GGCCGGCACC CGTTTCCTAA
 
Protein sequence
MSRQSLTKAH KKITELSWEP TFATPAKRFA TDYTFDKSPK KDPLKQILRS YFPMEEEKDN 
RVYGAMDGAI RGNMFRQVQQ RWMEWQKLFL SIIPFPEISA ARAMPMAIDA VPNPEIHNGL
AVQMIDEVRH STIQMNLKRL YMNHYIDPAG FDITEKAFAN NYAGTIGRQF GEGFITGDAI
TAANIYLTVV AETAFTNTLF VAMPSEAAAN GDYLLPTVFH SVQSDESRHI SNGYSILLMA
LADEANRPLL ERDLRYAWWN NHAVVDAAIG TFIEYGTKDR RKDRESYAEM WQRWIYDDYY
RSYLIPLEKY GLVIPHDLVE EAWNRIYNKG YVHEVAQFFA TGWPVNYWRI DPMTDDDFEW
FESKYPGWYN KYGKWWENYN RMRYPGRNKP IAFENVDYQY PQRCWTCMVP CVIREDMVHD
KVDDQWRTYC SEACHWTDKV AFRPEYQGRP TPNMGRLTGK REWETLYHNW DLADVISDLG
YVRDDGKTLI PQPHLELDDP SKLWTLDDVR GITFSSPNVA LNEMNDAERE AAMAAYRAGG
PAGRPAPVS