Gene Francci3_3449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3449 
Symbol 
ID3905689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4106416 
End bp4109586 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content77% 
IMG OID637880772 
Producttetratricopeptide TPR_4 
Protein accessionYP_482532 
Protein GI86742132 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0502975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG GGGGCGATGG GCACGGCGCT CCCGACTGTT TCGTGTCGTA CGCGGACGAC 
GGGCGGGCCT GGGCCGAATG GATCGCCGGG ACGTTGGAGG ATGCCGGCTA CCAGGTCCTG
CTTGAGTCCT GGGCGCCGGC GGGGACGCAT CGGGTGGGGT GGCTGGACCG GGCGGTGAGC
CAGGCCCGGC ACACGATCGC GGTGGTCTCC GACGGCTACC TGCGCTCCTC GCCCGCGGTC
GCCGAATGGG CGGCGGCGTG GTCGGCCGAC CCGACCGGAG GCGAACGCCG GCTGCTGGTG
ACCAGGGTGG CGGACCGGCC GCTGCCCGGT CTGCTAGGCC AACTCGTCCC GGTCGACCTG
TTCGACCGCA GCGAGATCGC CGTCCGGGCC AGCCTGCTCG CGGCCCTGCG CGGCGACCGG
CCACAACCGG AACAGCCGCC GGGCTTCCCG GGCCGGCGGG GGATCTTCCC GCCGGAGCTG
CCGGCGGTGT GGAACGTGCC GGATCAGCCG GCCCGGTTCG TCGGCCGGAC GGCCGCGCTC
GCCCGGCTCG ACGAGGCGCT GTCCCGCTCG CCGCTGGTCA CCGTGACGGG AATCGCGGGG
ATCGGGAAGA CCAGCCTGGT CACCGAGTAC GTCCGCCAGC ACCGGGCGGA TCTCGACGCC
GTGTGGTGGG TGCCGGCGGA ACGCCCCGAG CTGATCGGCG AGCGGGTGCG CGAGCTCGCC
CCGGCGGTGG GCCTGCCCGG CCACGCGGAA CCGGCCGCCG TGATCGCGAA CCTCGGCCGG
GCCGACGGCC GCTGGCTGAT CGTCCTCGAC GACGCCGCCG ACGCCGACAC GCTGCCGGAC
TGGCTGCGGC CCACCGAACC CGGCCGGCTG CTACTGACCT CCCGCAACCC CGGCTGGGAC
CATCTCGGCC CGGTCGTGCC GGTCGGACCG ATGGACCGGG CCGAGTCCGT CACCCTGCTC
GCCGGCCGGC TGCCCGCCGT CGAACGGACG GTCGCCGACC GGATCGCCGA GCGGCTCGGG
GACCACCCGC TCGCCCTCGA CCAGGCCGCC CACCGCATCA GCACCGGCCG CACCCCCGCC
GAGACCTACC TTCAGGTGTT AATCGACCGG CCGGAGGTCC TCCTGGCGCA GGGGGAGGTG
AGCGGGCGGC CCGGGGCCAC CGCCGCCACC CTCTGGGACG AGCCGATCCG CCGGCTCGAC
GCCGAGGCAC CGGCCGCCGG CCACCTGCTG CGGATCGTCG CGCACGGCGA CAACGAACCG
ATCCCGATCC GGCTGTTCAC CGCCGAGCCG GACGTGATCC CCGATTCCGG GCTGCGGGCG
GCGGCCGGCG ACCCGCTCGC CCTCGCCGAC ACCGTCGGCG TCCTGGAGCG GTACGGCCTG
GCCCACCGCG ACGCGGACAC CGTGACGATG CACCGGCTGG TGCGGGCCGC CGTGCACACC
CACACCGGCC CCGACCAGGC CGACGAGATC GTCGCGACCA TGGCCCGGAT GCTGCGCGCG
TCGCTGCCCG ACGCCGTCAC GGCCAACCCG GAAGCCTGGC CCGCCTGGCG GGAGCTGCTC
CCGCACACGC TCGCCGTGCT CGACGCCACC GCCACCGACT CCACCGACGC CGACTCCACC
GACGCCGACG GGCCGGAGGC CGACGGGCCG GAGGCCGACG GTCCGGAGGC CGCCTGGCTG
GCGGAACGCT CGGCGGCCTA CCTGCTCGGG CAGGGCCGAG CCGATCAGGC CCTCCCCCTC
GCGGCCCGCG CCCTGGCCGC CCGCGAGCGG CTCGACGGCC CGGTGCACGT CGACACCCTG
GCCTGCCGGG AGACCCTCGC GCGGGTCGCC CTCGGCGCCG GACGGATCGA CGCCGCCGGC
CGCCTGGCCG AACACACCGT CGCCGACCGC GAACGCATCC TCGGCCCCGA GCACCCCGAC
ACCCTGGTCA GCCGGGACAC CCTAGCCCAG CTATTCCAGA AGGTCGGCCA GACCGACCGC
GCGGTCGAGA TGTTCCAGCG CACCCTCGCC ACCCGGGAAC GCATCCTCGG CCCCGAGCAC
CCCGACACCC TGGAAGGCCG GCACAGCCTC GGCCGCGCCT ACGACGCCGC CGGCCGCGAC
GACGAGGCGG CCCGGCTGCT GCGGGACACC CTCGCCGACC AGCGGCGGAT CCTCGGCCCG
GAGCACCCCG CCACCCTCGA CACCCGCCAC AGCCTCGCCG TCGCCTACCG CCGGATCGGC
GCGGTGGGCG ACGCCGTGCC GCTGTTCGAG CAGACCCTCG CCGCTCGCGA ACGGGTCCTC
GGCCCCGAGC ATCCCGCCGC CCTCGGCACC CGCCACCAGC TCGCCGTCAC CCATCACCAG
GCCGGACGGC TCGACGACGC CACGCGCGAG TTCGGCCGCG CGCTGACCGG CCGCGAGCGC
ACCCTCGGCC CCGACCACCC CGACACCCTC GAAACGGTCG ACGGACTCGC CCGCGCCCAC
CTGTACGCCG GACGGCTCGA CGACGCCGCA CGCGAGTTCG GCCGCGCGCT GACCGGCCGG
GAACGCGCCC TCGGCCCCGA CCACCCCGAG ACCACCGAGA CCCGCGAGAG CCTCGCCACC
GCCCACCTCA GGCTGGGCCG GCCCGGCGAG GCCGTCCCCC ATCTCGAACG CGCCCTCGAC
CGCCACGAGC ACGTCCTCGG CCGCGCGCAC CCTTACACTG TCGAGGCCCG GAACGCCCTC
GCGACGACCT ACCGGCGCAC CGGGCAGCTT GAGGCGGCCG TGCCGCTGCT CGAACGGCTG
CTCGCCGACC GGAGCCGGGC GCACGGCGTC GGCGACGCGC GCACCCTGCG CACCGCCGAC
GGGCTCGCCG AGGTCTACCG GACCACCGGC CGGCTTCCGC AGGCCGTCGG CCTGAACGAA
CGTGTCCTCG CCGTCCGCGA GCGGCTCCTC GGCCCCGGCC ACCCCGACAC CCGCGCCACC
CGGGGCGCCC TCGCCGACAC CTACCGCCAG GCCGGCCGGC CCGCCGACGC CGTCCCCCTC
TACCGCGCCG CGCTCACCGA CGCGCTGCGC GAGCACGGCC CGTTCCATCC GGACAGCACC
CGGGCCCGCC GCACCCTCGC CGACACTGTC GGCGAGACCC GCCACGATCA GCCGCCGCCC
CTCGAACGGC CGATCCCGGT GGAGCCGAGA TTCCGCCACC GCGACCCCTG A
 
Protein sequence
MTGGGDGHGA PDCFVSYADD GRAWAEWIAG TLEDAGYQVL LESWAPAGTH RVGWLDRAVS 
QARHTIAVVS DGYLRSSPAV AEWAAAWSAD PTGGERRLLV TRVADRPLPG LLGQLVPVDL
FDRSEIAVRA SLLAALRGDR PQPEQPPGFP GRRGIFPPEL PAVWNVPDQP ARFVGRTAAL
ARLDEALSRS PLVTVTGIAG IGKTSLVTEY VRQHRADLDA VWWVPAERPE LIGERVRELA
PAVGLPGHAE PAAVIANLGR ADGRWLIVLD DAADADTLPD WLRPTEPGRL LLTSRNPGWD
HLGPVVPVGP MDRAESVTLL AGRLPAVERT VADRIAERLG DHPLALDQAA HRISTGRTPA
ETYLQVLIDR PEVLLAQGEV SGRPGATAAT LWDEPIRRLD AEAPAAGHLL RIVAHGDNEP
IPIRLFTAEP DVIPDSGLRA AAGDPLALAD TVGVLERYGL AHRDADTVTM HRLVRAAVHT
HTGPDQADEI VATMARMLRA SLPDAVTANP EAWPAWRELL PHTLAVLDAT ATDSTDADST
DADGPEADGP EADGPEAAWL AERSAAYLLG QGRADQALPL AARALAARER LDGPVHVDTL
ACRETLARVA LGAGRIDAAG RLAEHTVADR ERILGPEHPD TLVSRDTLAQ LFQKVGQTDR
AVEMFQRTLA TRERILGPEH PDTLEGRHSL GRAYDAAGRD DEAARLLRDT LADQRRILGP
EHPATLDTRH SLAVAYRRIG AVGDAVPLFE QTLAARERVL GPEHPAALGT RHQLAVTHHQ
AGRLDDATRE FGRALTGRER TLGPDHPDTL ETVDGLARAH LYAGRLDDAA REFGRALTGR
ERALGPDHPE TTETRESLAT AHLRLGRPGE AVPHLERALD RHEHVLGRAH PYTVEARNAL
ATTYRRTGQL EAAVPLLERL LADRSRAHGV GDARTLRTAD GLAEVYRTTG RLPQAVGLNE
RVLAVRERLL GPGHPDTRAT RGALADTYRQ AGRPADAVPL YRAALTDALR EHGPFHPDST
RARRTLADTV GETRHDQPPP LERPIPVEPR FRHRDP