Gene Francci3_0933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0933 
Symbol 
ID3906097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1092200 
End bp1094533 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content73% 
IMG OID637878267 
Producthypothetical protein 
Protein accessionYP_480046 
Protein GI86739646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGG GCACGCTAAC CACGACTGAG GCCCGGCTGA TCCAGGGGCT CTTCGTTCCA 
CGGGAGCGTT GGGGCTGGGA GTTCGTCGAC ACCGGTCCAT CGGCCCCGAG CCCGCTCGCC
GACCGTCAAC CGGTCTGGCA GGAGCCCGAA TCGCCCGATC TGACCGCGCT CACGGCCCGG
CGAAACCATG CCGTCCGCGC CCTGCTGACC AGGCTCGTCC TGCCGATGGC CTTCGTGGTG
TTCGGCATGC TGCTCGCGGG CGAGGGGGGG TTCTTCTTCG TCCCGGCGGG ACTCGCGCTC
GCCGTGGTCG CGTGCCTGCC TCCGCTGATC CTGCACTACC GGCTGGGCCG GACACGATCC
CGGGCCGCGT CGGCCCGGGC CGACGGATAC ACCCGCCACC TGCAGGCGTT GACGTCCTGG
CAGACCGAGG TCGACGCGCA CGACCGGGCC GAGCTGCGAC GCCGAGAGGC CACCGCCGTC
TTCTATCCCG TCTCGGCCCC CGCGGCGACC CCCAGGATCG ACGTGTTCGG CGGAACCGGC
GACGGATGGG CGAGCCTGCT GGCCACCGCG GGCTGCTCGG TGCTCACCGG CGGCACCGGC
ATCCTGCTCG TCGATCTCAG CGAGCGGGCG GTGGGCGGGG GGTTGGCACG CCTCGCCGGC
CAGGCGCGCT GGCCGGTGGA GTCCTGGGAC CTGCCCCGCC AGCTCGACGA GGTGGCGCTC
CTCGACGGCC TCGGGCCGCG GGAGGCCGGC GAGCTCGTCG CCGACGCGTT CGGCATGTCC
CAGGAGAGCG GCGATGATCC GCACCGACGC GCCCTGCACG CGGACCTGGT CACCACCGTC
GCCGTCGTTC TGTACGAACG GCTCAGCATT CGCCGTCTCG CCGAGGGCCT GCGCGTCCTC
GAAGGTCTCG ACGACGGCAG CGCGGCCACG TCGCTGTCGA CGGCCGAGGC CGGGGAGCTC
ACCGCGCACA TGGACGCGTT CGGCCGCGGT GACCGGATCG CCGACGAGCT GCGCTACCTG
CGTTCCTCGC TGGAGGCGCT CAGCCCGCCA CCACCAGCCA CGGCCGCGAC GGCACCGGCC
GCGACGGCAC CGGCCGCGAC GGTCGAGGCC CATATGTTGG CGGCCGCCGC CGATATGTCG
ACGGCCAGGA TGCCGACGGC CAGGATGCCG ACGGCCAGGG TCGACAGCGC GCCCCGTCCA
CTGACGGCCT GGTGGCCGGT ACCCGGGCTG CGGGTTCTCG CCACGTCGAG CCGGGACTCG
TCCGCCCACC GCAAGGAACT CACGGACCGG CTCCTCGTCC AGACCGTGCT CCACCAGCTC
CGTTGCCGCC AGCGGGGCGG GCCGACCGGC CGGGACATGC TGGTCGTCGC CGGGGCGGAC
CATCTCGGCC GAGCGGTCCT GACCGGGCTG ACCCGCCACG CCGAGCTCGC CGGGGTGCGG
CTCGTGCTGC TCTTCGAACG GCTCGCCGAC GACGCCGAAC GGGTGCTGGG CAGCCAGGGA
GGCTCGACGA TCATCATGCG ATTGGGGAAC GGTAAGGACG CCGCGACCGC GGCGGAATTC
ATCGGGCGTG GTTACCGCTT CGTCATGTCT CAGGTGACGG CCCAGGTGGG GCGCAGTTTC
ACGGCCGGCG ACAGCGACAG CGTCGGTTAC CAGGACGGCA CCTCGGAGAC GACCGGCACC
AGCGGCGGAA CCGGCCGGAC CTACGACGGC ACCCGGCTGC TGCCCTGGTT GCAGAACACG
TCGAAGAACG CCGGATGGCA GGAGTCGGTG ACGGCGTCGC GATCGCAGAC CTGGCAGCGC
ACGACAAACT CGTCGGTGAG CGACTCGACA ACCGACGGCA CGACCTCCCA GCGGGTATAC
GAGTTCACCG TGGAGCCGAC CACCGTCCAG ACCCTGCCGG CGACCGCGTT CCTGCTGGTC
AACCCGGTCG ATGGGGCGGG ACGGGTGGTG CCGGGGGACT GCAATCCGGG CACCGTCCTG
CTGCCTGGGG TCTCCGCGGC CGACCGCTGG TCCGTACCTG ACGCGGCCGC CACGCCCCTG
GCGGGTCACC ACGTCGGAAC CGCCGGAGCC GACGTGATCC AGGCCGGTAC GGACCAGGCC
GGCCCTGCGC GGACCGGCGC GGAGGAACAG CGTCGGATCG AGCCTGCCCC GCCCGTCGGG
TTTCCGGGCA TCGGCCAGCA ACCGCCGGCC GACGAGGAGG TCTACCGGCT CGTCCCGCCC
AAGGCGCGTC CCAGCGGCCC CGGCAGTCCC GGCGGTCCCG GCGGCCAGAG CACTCACCCG
GGGGCGCGGG GATGGGTTCC CGGCGTCGGG GACCACCACC CGGGGCATGG CTGA
 
Protein sequence
MGVGTLTTTE ARLIQGLFVP RERWGWEFVD TGPSAPSPLA DRQPVWQEPE SPDLTALTAR 
RNHAVRALLT RLVLPMAFVV FGMLLAGEGG FFFVPAGLAL AVVACLPPLI LHYRLGRTRS
RAASARADGY TRHLQALTSW QTEVDAHDRA ELRRREATAV FYPVSAPAAT PRIDVFGGTG
DGWASLLATA GCSVLTGGTG ILLVDLSERA VGGGLARLAG QARWPVESWD LPRQLDEVAL
LDGLGPREAG ELVADAFGMS QESGDDPHRR ALHADLVTTV AVVLYERLSI RRLAEGLRVL
EGLDDGSAAT SLSTAEAGEL TAHMDAFGRG DRIADELRYL RSSLEALSPP PPATAATAPA
ATAPAATVEA HMLAAAADMS TARMPTARMP TARVDSAPRP LTAWWPVPGL RVLATSSRDS
SAHRKELTDR LLVQTVLHQL RCRQRGGPTG RDMLVVAGAD HLGRAVLTGL TRHAELAGVR
LVLLFERLAD DAERVLGSQG GSTIIMRLGN GKDAATAAEF IGRGYRFVMS QVTAQVGRSF
TAGDSDSVGY QDGTSETTGT SGGTGRTYDG TRLLPWLQNT SKNAGWQESV TASRSQTWQR
TTNSSVSDST TDGTTSQRVY EFTVEPTTVQ TLPATAFLLV NPVDGAGRVV PGDCNPGTVL
LPGVSAADRW SVPDAAATPL AGHHVGTAGA DVIQAGTDQA GPARTGAEEQ RRIEPAPPVG
FPGIGQQPPA DEEVYRLVPP KARPSGPGSP GGPGGQSTHP GARGWVPGVG DHHPGHG