Gene Francci3_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4186 
Symbol 
ID3907151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4995603 
End bp4998494 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content70% 
IMG OID637881514 
Producttetratricopeptide TPR_4 
Protein accessionYP_483263 
Protein GI86742863 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGG CTGGCACGGA TACCACCCAC GACGCCGGGA AGACGGTGGA CTTCTTCGTC 
TCCTACGCCG GCGTCGACCA GGCGTGGGCG GAGTGGATCG CGGACACCCT GGAGAGGGCT
GGCCGCCGCG TGCTGCTCGA GGCATGGGAC TTCGTCCCCG GGTCGAACTG GCCTGCGCTC
GTCCAGGCGG GACTGACCAC CGCAGGCCGG CTCCTGGCGG TGCTGACCCC GGCCTATCTC
GCCTCGGTCG GCGCGGCGGC GCAGTGGCAG GCGATCTGGG CGGCGGACAT GGGCGGGCGG
GGACGCCGGC TGATTCCGGT GCAGGTCGAG CCGTGCGAGC CCGACACGCT CGGCCTGCTC
GGCAGCCTCA CCTGGATCGA CCTGACCGAC CTGACCGACC TGGCCGGGCC GGGCGCCGCC
GCAGCCGCTC GCGCCCGGCT GCTGGACGGG ATACGTGCCG CCGTGACAGG GCAGGTGCGG
TCGGTCAATC CCACGCCGCT CCCCACCCGC GACACCCGCG ACGTGCCCGC GGCGGCGGAC
GAGTCCGAAC GGGCAGCGCG GTATCCGGGT CGGCCCCCGC TCGTGTGGGG GCTGCCGCCG
GAACTGGTGC GTAACCCCCG CTTTGTCGGG CGTGCATCCG AACTCGCCGA GCTGCGCGCG
CGACTGACGG GCTTCGACCG GGCCGTGCCC GGCCCCGGTT CCCCTGTGCC GGGGGTGCGG
GCCCAGGTGG TGCATGGCCT GGGCGGGGTC GGCAAGACGG AGCTCGTCGT GGAGTTCGCC
TACCGGTACA GCCACGCCTA CGACCTGGTC TGGTGGATCC GCGGCGACAA GATGACGGAG
GTCGTCGCCG GCCTCGCCGC CCTCGCGGCC CAGCTGGGAC TCGCCCACGT GGAGGGGGAC
GAGGCAGCGG CCGCCGCGGC GGCCGAGGCG TTGCGCGCCG GTCGACCTCA TCGCCGGTGG
CTGTTGGTCG TCGACAATGC CGTTATTGGC AATGCCGTTA TTGGCAATGC CGTTATTGGC
AATGCCGTCA TGGGCAACAA CGCCGGTGCT ACCGCCACCA ACCCCTACCT GTCCAGGCTT
CTGCAGGCCG CGGAGACCGC CGAGTTCGGA CATCTCGTGA TTACCTCGCG CAACCCCGAC
TGGGCCGGTC GGGCTACCGC AACCGAGGTC GCGGTGCTGC CCCGGAGGGA CACGATCGAT
CTGCTGAGGA GGCACCGAGC CGCGCTGGAG GACGCCGAGG CTGAGCGGCT CGCGGCCGCG
GTCGGGGACC TGCCGCTGGC GGCGGAACAG GCCGGAGCCT GGCTCGCCGC CAGCGGCATG
ACCGTTTCGG ACTACCTGGC TGCGTTGCGC GTGGAGACTC GGGAACTCCT TACCCGCGGA
AAACCGGACC GCTATCCGTT CATTGTCGCC GCGGCGTGGA ACCTGGCCCT GGACGAGATC
GGACCGGACG AACCGGCCGT CGTCGAGCTG CTGCAGCTGG CCGGATTCTT CGGCCCGGAG
CCCATCCCGC TCGACCTGTT TCCCGCGCTG GTGGTGCTGG GGGAGAGCAA CGGGGAGCTG
GGGGAGAGCA ACGGGGAGGG GACGCTCTGG TCGGCGCTGA GCGCGGCCTG TTCCTCCCCG
CTGCGCTGGG GGGACGTGGT CGCCCGCCTC CGTGCTCTCG GGCTGGGCAA GGTCGAACAG
GGAACCATCA CCTTGCATCG GCTGGTGCAG GCGGTGCTGC GGGACCGGGT TCCGACGTCG
CGGCACCTGG AGTTCCGGGC CACCGTCGGC TGGTTGATGA TCAGGGCCCT GCCCACCGCG
ATCCAGGGTA ATCCGGCGGC CTGGCCGCGC TGGGCCGCGC TGCTCCCACA CCTTCAGGCC
CAGATCGAAG CCGAGCCGTC CCCGGCGGAC GCGAACGCGG CGGGCATGCT GGGGCTGGGC
AATCTGGCGG CGCTCTATCT CCAGGAAACC GGACAGCTCG CCGCCGCCGT GGACCTGCAC
ACCTGGGTGC TGGCTCGAAC CGAGCAGGTC GTGGGAACCG ACCACCCCAA CACCCTGGTT
CTCCGGAACA ACCTCGCCCT CGCCCTGCAG AAGGCCGGAC GCATTTCCGA GGCCATCGGG
TTGTACGAGC GGATCCTGGC GCGGGCCAGC CGCATCCTCG AATCCGATGA TCCCAATCTG
GGGATCAGCC GTAACAATCT CGCCAGCGCC TATTGGGCCG CCGGACGCAT CGCGGAGGCC
GTCGAACTGC TCGAGCAGGT TGTCGACGAC GCTGGACGGA TCCGTGGCCC GGACCATCGC
GACACCCTGC TGGCCCGGAG CAACCTGGCC AACGCATACC AGACGGCGGG GCGCTTGGCG
GAGGCGATCA GGCTGCACGA AGAGGTTCTC GTCGACGTCG TCCGGGTTCT CGGCCAGGAG
CATCTCACGA CCTTCCTCGT GCGCAACAAT CTCGCCAGCT CCTATCAGGC CAGCGGACGG
ACGACTGAGG CCCTCGACCT GTACGAGCAG GTACTGACCG GCCGGGAAGG CGTTCTCGGC
GACAACCATC CCGACACCCT GCTCTCCCGC AGCAATCTCG CCGACGCCTA CCAGGCGCTC
GGACGTCTGC CCGAGGCGAT CGATCTGTAC GAGCGGGTTC TCTCCGATGC CCGCGGTGTT
CTCGGCGCTG ACCATCCACA CATCCTGATC ACCTGGAACA ACCTGGCCTG CGCCGTTGCG
GCCGCGGGGC GCCTCGCGGA GGCGATCGAC CTGTACGAGC GGGTACTGGC CGACCAGCGG
CGTGTCCTCG GACCGGATCA CCGCGACACC CTGACCTCGC AGGGAAACCT CGCCAACGCC
TACCGGGCCG CCGGACGCAC GAATGCCGAC GCCGCTGATG GTGCCGACGC CGCTGATGGT
GCCGGGGACT GA
 
Protein sequence
MNPAGTDTTH DAGKTVDFFV SYAGVDQAWA EWIADTLERA GRRVLLEAWD FVPGSNWPAL 
VQAGLTTAGR LLAVLTPAYL ASVGAAAQWQ AIWAADMGGR GRRLIPVQVE PCEPDTLGLL
GSLTWIDLTD LTDLAGPGAA AAARARLLDG IRAAVTGQVR SVNPTPLPTR DTRDVPAAAD
ESERAARYPG RPPLVWGLPP ELVRNPRFVG RASELAELRA RLTGFDRAVP GPGSPVPGVR
AQVVHGLGGV GKTELVVEFA YRYSHAYDLV WWIRGDKMTE VVAGLAALAA QLGLAHVEGD
EAAAAAAAEA LRAGRPHRRW LLVVDNAVIG NAVIGNAVIG NAVMGNNAGA TATNPYLSRL
LQAAETAEFG HLVITSRNPD WAGRATATEV AVLPRRDTID LLRRHRAALE DAEAERLAAA
VGDLPLAAEQ AGAWLAASGM TVSDYLAALR VETRELLTRG KPDRYPFIVA AAWNLALDEI
GPDEPAVVEL LQLAGFFGPE PIPLDLFPAL VVLGESNGEL GESNGEGTLW SALSAACSSP
LRWGDVVARL RALGLGKVEQ GTITLHRLVQ AVLRDRVPTS RHLEFRATVG WLMIRALPTA
IQGNPAAWPR WAALLPHLQA QIEAEPSPAD ANAAGMLGLG NLAALYLQET GQLAAAVDLH
TWVLARTEQV VGTDHPNTLV LRNNLALALQ KAGRISEAIG LYERILARAS RILESDDPNL
GISRNNLASA YWAAGRIAEA VELLEQVVDD AGRIRGPDHR DTLLARSNLA NAYQTAGRLA
EAIRLHEEVL VDVVRVLGQE HLTTFLVRNN LASSYQASGR TTEALDLYEQ VLTGREGVLG
DNHPDTLLSR SNLADAYQAL GRLPEAIDLY ERVLSDARGV LGADHPHILI TWNNLACAVA
AAGRLAEAID LYERVLADQR RVLGPDHRDT LTSQGNLANA YRAAGRTNAD AADGADAADG
AGD