Gene Franean1_2793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2793 
Symbol 
ID5671182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3301627 
End bp3304701 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content72% 
IMG OID641241702 
Producttetratricopeptide TPR_4 
Protein accessionYP_001507122 
Protein GI158314614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGGC GGCCGGGCGA CGGCGGATCC GGACAGCCGT CCCGCCGCCA GCGGCCGGCG 
GATCATCGGG CTGGGTGTAC GGGGGCGCAC GTGGAACGCG ACAGGTCGCT GGACGCGGTG
GTGGTTCCAG ATCCGTCCGG CGTGAACTCT CCTGAAGGTC TCGCCGCCGC GTTGGGCGCG
CTGCGCCGCC GCGCCGGCCT GTCGGTCCGC GACATGGAGA TGACCGCGAG GAAGCGTGGC
TTGAGCCTGC CTCGGACCAC GGCGAGCGAC GCGGAGAACT CCGCCAGGCC ACGGCCGTCG
AAGCCGACCG TCAGCGCGTT CCTCGACGTG TGTGATGTGC CGGCGGCAGA GCGGGTGCAG
TGGTTGGCCG CGTGGGAACG CGCCGGCAGC AAGGTGAAGG ACCCACCTGC GGGGTGGGTA
CGCGTGGAGG ACGCCGACCC GTACCGGCTG GGCGTGCACC AGCCGATCCA GGTGGACGGG
GCAAAGGACG ACGATCTGCC GACCTATGTG GCCCGGGACG TCGACGAGGC GGTCGGCGGT
GTGCGCTCCA GGCTCGCGGC GGCGGCCGAG CATGGCGGGC TCGTGTTGTT GGTCGGAGAC
TCCTCGGTTG GTAAGACCCG CACGGCCTAC GAGGCCGTCC GGGCCGAGCT GCCGGGCTGG
TGGCTGGTCC ACCCCGCCGA CGCGGTCGAG GTCGCCGCGC TGGTCGCGCG CCGCCCCCGC
CGGCTGGTCG TCTGGCTCGA TGAGATCCAG AACTACCTCG ACGCCGACCC GGGCCTGACC
GCTGGGGTGC TGCGCGACCT GTTGGACGGG GCGGGTCCGG TGGTGGTGGT TGCCACGATC
TGGCCGTACT GGCACAGCCT CTACACCTCG CTGCCCAGCC GGGACGGCAA CGAGGACCTC
TACCGGGAGC AACGGATGCT GCTGCGCCTG GCCAGAGAGG TACACGTCCC GGGCGCGTTC
AGCGACGGTG AGCAAGGCCG AGCCGAGCTG GCCGCGCAGG CCGATCCCAA ACTGCGGACC
GCGCTGGCGA TGACCGGCTA CGGGCTGACC CAGACCCTGG CCGCCGCTCC GCAGCTGGTC
GCCCGCTGGG AACGGGCCCG AGGCGCTGGC GGGCCGAGCC GTGGGCCCTA CCGGTGGGCA
GTGCTGACCG CCGCGCTGGA TGCCGCCCGC CTGGGCGCCC GCGGACCCCT GCCGACCCGG
CTGCTGGAGG CGGCAGCGCC TGGCTACCTC GACGACCATG AACGGGCGCG TGCGCCCGCC
GACTGGTTCG ACGACGCCCG CGCCTATGCC ATTGACAACA CCACGATGCA CGGCGCCGCC
GCCGCCCTGG AGCCAACCGG CCCCGGCGGC ATGGGCCAGG TCACCGGCTA CACCCCCGCC
GACTACCTCG TCCAGTACGC CACCCGTACC CGTCGCCGCG AGCGGCCACC CGCCAGCCTC
TGGATCGCGC TGCGCGACCA TCTCACCGAT CCCGCGGACG TCCACCGCGT CGCTGATGCC
GCGTTCGACC GCTACCAGTA CGGCGCCGCG GTCCCGCTCC TGCATAAGGC TGCAGATGCT
GGCTACCGGT CCTCGGCCGG TCGGCTCGCC GGGCTGCTGC TCGAGGTCGG GGATGTCGAC
GGCCTGCGTG ACCGAGCCGC GGCCGGCGAC GGCGAGGCGG CCCGCGTGCT CTCCCAGCAG
CGGACCAGAG CCGGGGACCT TGAGGACGCG GCCGATGTCC TGCGGCCGGC TGCCGACGCA
GGCGACTGGC GTGCCGCCGC CCGGCTCATC GACCTGATGC TGGAGGTCGA TGACCTCGAC
GGGCTGGCCG CGCGTGCCGA AAAGGGAGAC AAGGAAGCAG CAGTCGCTCT TGCCCTGCGT
TTAGCCGAAG CCGGGGACGT CGACCAGCTG CGAGCACGCG CCGACACCGG CGATCCGGCG
GCCGCCGGGA CACTGGCCGA AATACTCGCC AAGGCCGGCG ACGTCGATGA ACTGCGCGCC
CGGGCGGGCG CGGGAGACAT CTGGGCCGCT GACTGGCTCG CCGAACTGCT TGTCAACGCC
GGCGACCGTG CCGGGGCGAT CGCCGTTCTG CGTGCCGCCG CCCGCGACGG CGTCGAGGAG
GTCAAGGATG TCGACCAGTT GGCCTGGCTC CGGGCCTGCC TTGCCGAGGT GCTCCTCGAG
ACCGGGGATC ACGACGGTGC GATCGCCGTG ATGCGCGCCG CCGACATGAG CGACCCCGAC
GCCGTTGACG ACCTGGTCGA CGTGCTGGTC TCCGTCGGTG ACCGGGACGG GGCGATGACC
ATCCTGCGTC CGGTCGCCGA CGCGGGTGGC CGGCACGCGG CCACCCATAT CGCCGAGCTG
CTGGTCGAAG GGGGCCACGT CGACGAACTA CGGGACCGTG CCCTCGGTGG CGACCAGGAG
GTCGTCTTCA GACTGGTGGA CGTTCTGCTC GCCGCGGGAG ATCGCGCCAG CGCCATCACC
ATCCTGGCGG CCGGCGCCGA CGCTGGCGAC TGGGAGGCCG CCGACTGGCT CGCCGCGCTC
CTGCTCGAGG ACGGCGATCG TGACGGTGCG ATCACCGTCC TGCGCGATCA CCTCGACGCC
GACGACGGTG GTAACTGGAT CATCCGCCAA CTCGTCAGGC TCCTTCACGA GGCTGAGGAT
CACGAAGGCC TGATGGATGT CCTGCGCGCC CGCACGTCCA CCGGCGACAC CTGGGCGACA
CAGCAGCTCG TTGAGCTCCT GGTCGAAGCT GGAGACGATG AGGGCGCACG TGCTGTCCTT
CGCGCCCGCG CGAATGCCGG TGACGCGCGC AGCGCCTTCC GGCTCGTGAC CATGTTGACT
GCGGCCGGAG ACCGCGCCGG TGCGATCGCC GTCCTGCGCG CCCATGCCGA CGCTGGCAAC
GGAGTAGCCG CATTTGAACT CGCCGCCCAG CTGAGTCGGG ACGGAGAACT CGACGAACTA
AGTGCCCGTG CGACCGCCGA CGACACCTGG GCCGGCGCCC GGCTGCACGC GGTACTGAGC
GCCGATGATG ATCAGCGGCT GCACCGCTAT GGCCTCACGA TGGAGGGGGA GGTCGCCGAC
GGCCCTACCT GGTGA
 
Protein sequence
MSGRPGDGGS GQPSRRQRPA DHRAGCTGAH VERDRSLDAV VVPDPSGVNS PEGLAAALGA 
LRRRAGLSVR DMEMTARKRG LSLPRTTASD AENSARPRPS KPTVSAFLDV CDVPAAERVQ
WLAAWERAGS KVKDPPAGWV RVEDADPYRL GVHQPIQVDG AKDDDLPTYV ARDVDEAVGG
VRSRLAAAAE HGGLVLLVGD SSVGKTRTAY EAVRAELPGW WLVHPADAVE VAALVARRPR
RLVVWLDEIQ NYLDADPGLT AGVLRDLLDG AGPVVVVATI WPYWHSLYTS LPSRDGNEDL
YREQRMLLRL AREVHVPGAF SDGEQGRAEL AAQADPKLRT ALAMTGYGLT QTLAAAPQLV
ARWERARGAG GPSRGPYRWA VLTAALDAAR LGARGPLPTR LLEAAAPGYL DDHERARAPA
DWFDDARAYA IDNTTMHGAA AALEPTGPGG MGQVTGYTPA DYLVQYATRT RRRERPPASL
WIALRDHLTD PADVHRVADA AFDRYQYGAA VPLLHKAADA GYRSSAGRLA GLLLEVGDVD
GLRDRAAAGD GEAARVLSQQ RTRAGDLEDA ADVLRPAADA GDWRAAARLI DLMLEVDDLD
GLAARAEKGD KEAAVALALR LAEAGDVDQL RARADTGDPA AAGTLAEILA KAGDVDELRA
RAGAGDIWAA DWLAELLVNA GDRAGAIAVL RAAARDGVEE VKDVDQLAWL RACLAEVLLE
TGDHDGAIAV MRAADMSDPD AVDDLVDVLV SVGDRDGAMT ILRPVADAGG RHAATHIAEL
LVEGGHVDEL RDRALGGDQE VVFRLVDVLL AAGDRASAIT ILAAGADAGD WEAADWLAAL
LLEDGDRDGA ITVLRDHLDA DDGGNWIIRQ LVRLLHEAED HEGLMDVLRA RTSTGDTWAT
QQLVELLVEA GDDEGARAVL RARANAGDAR SAFRLVTMLT AAGDRAGAIA VLRAHADAGN
GVAAFELAAQ LSRDGELDEL SARATADDTW AGARLHAVLS ADDDQRLHRY GLTMEGEVAD
GPTW