Gene Francci3_3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3078 
Symbol 
ID3904280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3647638 
End bp3649353 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content76% 
IMG OID637880399 
Producthypothetical protein 
Protein accessionYP_482164 
Protein GI86741764 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCAG CGCGTGGTCG TGGCCGGTGG CCCGCCGGCC CGCGGCGGCC GACGTCGTGG 
TGGCGGGAGT CGTCGTGGTG GCGGGAGTCG TCGTGGCGGG GCATGGCCGG CGGTGGACGG
CGGGCCGAGC GGGTCGCCGA CCTGCTGGAC GGACAGCGCC CCCCGTATGA CGAGACCGAC
GCGCGGCTGA TCGACACCGT GGCGGCCCTC CGGGACCTGC CGTCACCGCG GCTGCATCCG
GCACGGCACG CCGCCCTGCG CGGACAGCTG TTCGCCGCCG TCACCGGCTC CCCGGCATGC
GATCCCGTCC GTTCCCCCGC CGCCGTTTCC CCCGGCACCT CCGTGAAGGG CCACGAGACC
GAGGCCCGTC CCACAAGAAC CCGTCCCGCA CGCACGCCTC CCGAGGGCGT CGATCCTGAC
CTGCCGGGAA ACGATCCGGT CGACGCGCGG TGGGTACGCC GGACCGGTGC CCGCGGTGTC
TCTGCCTGCG GTGTCTCCGG CCGGGGCAGG ATGACCCGGG CCGCCCGGCC GCTGCTGGCC
GGAGCACTCA CCGCCGCCGT CACCACGGCG GCCCTCGCGG TCAGCTCGGG GGACTCGCTA
CCCGGCGATA CCCTGTACGG CGTCAAACGA CAGGTCGAAG ACCTCCAGGT GTCGCTGGTC
CGCGATCCGG TCGAGCGGGC GAAGACCCGG CTGGGCATGG CCGGCTTGCG GATGAGCGAA
CTGCGCACGA TCACGGTGAA CGACGGCGGG GTGATCGCCC CGGAAACCGG TGCCGGCGCT
CCGGAGACGA GCCCGCGGGT GCCGGTGGTG AACCCCACGG CGACCGCCGC ACCACCGACC
ATCGCGGTGT CACCGACCAT CGCGGTGTCA CCCGCTGCGG TGTCACCCAC TGCGGGGACG
TGGCCTCCCG GTCCCCGCAC ACCCGCCGTC TCCGGCGACG CCGGCGAATC CGATGGCCCC
GACGCGCCGA GCGACCCGCC CGGCCCCGGC AACGGGGACC GCCTCGACCC CGAGCTGGTC
AACGCGCTGC TGCGGGACTG GATCGCCGAG GTGCGCGCCG GCACGCAGGT ACTGCTGGCC
CGGGTCGCCG CCGGGGACAC GGACGCCTGG ACCACGGTGA ACGCCTTCAC CACCGAGCAG
TCCCGCGGGC TGAAGAACCT GCTGAGATCG CTTCCCGTGG GCTCGGTCGG ACCGGCGCAT
GCGGCTCTGG ATCTCATCGA CGACGTCAGG CGCAGGCTCG GCCCGCGGGC ACCAGCGCCG
GTCCGGGCCC CCTCACCCGT CCGCCAGATC TCTCCGATCG TGCCGACCGG TGACGCCCTC
ACCCCCCCGC CCTACGTGGC GCCGCTGCTC TCCGCACCTC GGCCCACGGC GACCGGGGCC
ACCGCCGCGC CCGCCCTCAG CACTCCCACC CCCGGCACTC CCACCCCTGC CGGCACCGGC
ATCGGCTCGC CGGGGCCCCC GAGCCCGGCG CCGGGTGGTG CCACGAGCGG AACGACCATC
CCGACCCCGA CCCCCGCCAC GACCCCGACC CCCGCCACGA CCCCGACCCC CGCCACGACC
CCGACCCCCG CCACGACCCC GACAGACGAC ACGACCCCGA CAGACGACAC GACCCCGACC
CCCGCCACGA CCGCATCCGA TGTCACGCCG ACCACGGTGG GCGCGCCCTC GCCGCAGAGC
CCCCCGAACG GGGCGCCGAC GCCGGGATCC CGCTAG
 
Protein sequence
MKAARGRGRW PAGPRRPTSW WRESSWWRES SWRGMAGGGR RAERVADLLD GQRPPYDETD 
ARLIDTVAAL RDLPSPRLHP ARHAALRGQL FAAVTGSPAC DPVRSPAAVS PGTSVKGHET
EARPTRTRPA RTPPEGVDPD LPGNDPVDAR WVRRTGARGV SACGVSGRGR MTRAARPLLA
GALTAAVTTA ALAVSSGDSL PGDTLYGVKR QVEDLQVSLV RDPVERAKTR LGMAGLRMSE
LRTITVNDGG VIAPETGAGA PETSPRVPVV NPTATAAPPT IAVSPTIAVS PAAVSPTAGT
WPPGPRTPAV SGDAGESDGP DAPSDPPGPG NGDRLDPELV NALLRDWIAE VRAGTQVLLA
RVAAGDTDAW TTVNAFTTEQ SRGLKNLLRS LPVGSVGPAH AALDLIDDVR RRLGPRAPAP
VRAPSPVRQI SPIVPTGDAL TPPPYVAPLL SAPRPTATGA TAAPALSTPT PGTPTPAGTG
IGSPGPPSPA PGGATSGTTI PTPTPATTPT PATTPTPATT PTPATTPTDD TTPTDDTTPT
PATTASDVTP TTVGAPSPQS PPNGAPTPGS R