Gene Francci3_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4388 
Symbol 
ID3907362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5244793 
End bp5246571 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content76% 
IMG OID637881719 
Producthypothetical protein 
Protein accessionYP_483463 
Protein GI86743063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCGCG GGCGGCGTCA GGCCGCGCCC GTCCCGTCCG GACCGGGCAG CGGCAGGCCC 
CGACCGGCCG GCATCCTGGC CCCGACCAGG GTGACCCTGC TGGTCCTCGC CGGTGGCGGC
GCCGCGGTCG TGGCCATGAT GAGCGAGGGT CGGCTCGGTT CGCGCGATCC GCACACGACC
GACCCTTCCT CCTGGTGGGG CATTCTCCCA TCGGCGGCCT CGTCGGATAC CGCAAGGGGC
GTCCTGGCCG GCCTGGCGGC CCTGGGCATC ATCACTCTGT GTCTGTGCTG GGGCGGACTT
GTGCGCGCGA CGATCGCCGG CCGCACCTCC CCGCGGGCCG CCCTCGCGGC GTGGATCGTC
TGGAGCCTCC CGTTCGCGGT GGGCCCACCA CTGTTCAGCC GGGACATCTA CGCCTACGCC
GCTCAGGGTG AGCTGGCGAG GCTCGGCCTG GATCCGGCGA CACACGGCGT CTCCGCCCTG
CTGACCGCCG AGGGCTCGGG CTCGGCGTTC GTCCGGGCCG TTGACCCCCG GTGGTGGAAC
ACCCATGCGC CCTATGGCGG TGCCGCGGTC GCCGTGGAGA AGAGCGCCGC GGCCCTCGGC
GGCGGGCCGG CCGGGACGGT CGCGCTGCTC AAGATAGTCG CGATCCTCGC AACGATCGCG
ATGGTCGTCC TGACGCTACG GCTCGCCGGA CCGGAGCCGG GCCGACGGGC CCTGGTCATG
GTGCTCGTCG CGGCCAATCC GGTCGTCGTC GTGCACCTGA TCGGCGGAGC CCACCTCGAC
GCAACGGCCG CGGCGCTGCT CGTGGCGGCG CTGGTGGTGG ACCGCCGACG GCTCGCCTCG
GCACCGGCGC CCGGCCCGGA GGGAAGGAGC ACGCCGACGG CTCGGCAGGC CGCGCTCGGC
GCCGCGGCGA CGGCGCTGGT CAGCGTGGCC GGAAGCGTCA AGGCGACCGC CCTCCTCGGC
GTCGCCTGGC TGGTGCTGGC GCATCTGCGG GCCGCCCGGT CGGCGCGGCG TCCGATCCGC
GCCGGCGCCG TGCTGCTCGC GGCCGATGCC GCCGCGGTCG CGCTCGCCGC GGGGCTGAGC
ATGGTGGCTG GCGGCTTCGG CCCGACCTGG ATCCACGCGT TGTCCACGTC GGGCACCCTG
ACCACGGGGG CCGCACCGGC CTCGATTCTC GCCGCGGCGG TCGACGGGGT CGACGGCATG
CTTGGTCCGG TCGGCCGCCA TCCGTCCGGC GCCGAAACGC AGCGGGCGAC TCGGGCCCTG
TGCCTGGCCG GCGCGGCGCT GGTGGTGGCC TGGCTCGCCC TGCGCGCCTG GCGGGACCGG
GACGGGTACA CCGGCCGTCG GCACGACCTG ATCGTGCTCG GCTACGGCGG GCTGGCGGTG
GCGCTCGGCA GCCCGGTGCT CTACCCCTGG TACCTCGCGC TGTGCCTGCC GGCACTCGCG
GTCGTCGTGG CCCTCGCCCG GTCAACCCCG CCGACGCCCC ACCCTGGAAC GGCCCGTCCC
AGGACGCCGG GGATGCAACC ACGCCGCATG AGGCGTCTCG GGGCGGCCGT CGAGCGGATC
GTCGGGGGGG TCGGGGGGTG GACCATCGCC CTGGTGGTCG TCACGTCGGC CTGGCTGTGC
CTGACGACCC TCGCGCCGCT CGCCGCGACC TGGCGGCTGC TCGGGCGCGT CGAGACGTCC
CTGGCCGTCA TCGGCTGCGC CGGCCTTGCG GCCGCGGGAA CGGTCATCAC GCTGGTCGCG
ATCCGGCGGC GGCACGCCAC GCCCACCCGC AGACGGTGA
 
Protein sequence
MPRGRRQAAP VPSGPGSGRP RPAGILAPTR VTLLVLAGGG AAVVAMMSEG RLGSRDPHTT 
DPSSWWGILP SAASSDTARG VLAGLAALGI ITLCLCWGGL VRATIAGRTS PRAALAAWIV
WSLPFAVGPP LFSRDIYAYA AQGELARLGL DPATHGVSAL LTAEGSGSAF VRAVDPRWWN
THAPYGGAAV AVEKSAAALG GGPAGTVALL KIVAILATIA MVVLTLRLAG PEPGRRALVM
VLVAANPVVV VHLIGGAHLD ATAAALLVAA LVVDRRRLAS APAPGPEGRS TPTARQAALG
AAATALVSVA GSVKATALLG VAWLVLAHLR AARSARRPIR AGAVLLAADA AAVALAAGLS
MVAGGFGPTW IHALSTSGTL TTGAAPASIL AAAVDGVDGM LGPVGRHPSG AETQRATRAL
CLAGAALVVA WLALRAWRDR DGYTGRRHDL IVLGYGGLAV ALGSPVLYPW YLALCLPALA
VVVALARSTP PTPHPGTARP RTPGMQPRRM RRLGAAVERI VGGVGGWTIA LVVVTSAWLC
LTTLAPLAAT WRLLGRVETS LAVIGCAGLA AAGTVITLVA IRRRHATPTR RR