Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2793 |
Symbol | |
ID | 5671182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3301627 |
End bp | 3304701 |
Gene Length | 3075 bp |
Protein Length | 1024 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241702 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001507122 |
Protein GI | 158314614 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGGGC GGCCGGGCGA CGGCGGATCC GGACAGCCGT CCCGCCGCCA GCGGCCGGCG GATCATCGGG CTGGGTGTAC GGGGGCGCAC GTGGAACGCG ACAGGTCGCT GGACGCGGTG GTGGTTCCAG ATCCGTCCGG CGTGAACTCT CCTGAAGGTC TCGCCGCCGC GTTGGGCGCG CTGCGCCGCC GCGCCGGCCT GTCGGTCCGC GACATGGAGA TGACCGCGAG GAAGCGTGGC TTGAGCCTGC CTCGGACCAC GGCGAGCGAC GCGGAGAACT CCGCCAGGCC ACGGCCGTCG AAGCCGACCG TCAGCGCGTT CCTCGACGTG TGTGATGTGC CGGCGGCAGA GCGGGTGCAG TGGTTGGCCG CGTGGGAACG CGCCGGCAGC AAGGTGAAGG ACCCACCTGC GGGGTGGGTA CGCGTGGAGG ACGCCGACCC GTACCGGCTG GGCGTGCACC AGCCGATCCA GGTGGACGGG GCAAAGGACG ACGATCTGCC GACCTATGTG GCCCGGGACG TCGACGAGGC GGTCGGCGGT GTGCGCTCCA GGCTCGCGGC GGCGGCCGAG CATGGCGGGC TCGTGTTGTT GGTCGGAGAC TCCTCGGTTG GTAAGACCCG CACGGCCTAC GAGGCCGTCC GGGCCGAGCT GCCGGGCTGG TGGCTGGTCC ACCCCGCCGA CGCGGTCGAG GTCGCCGCGC TGGTCGCGCG CCGCCCCCGC CGGCTGGTCG TCTGGCTCGA TGAGATCCAG AACTACCTCG ACGCCGACCC GGGCCTGACC GCTGGGGTGC TGCGCGACCT GTTGGACGGG GCGGGTCCGG TGGTGGTGGT TGCCACGATC TGGCCGTACT GGCACAGCCT CTACACCTCG CTGCCCAGCC GGGACGGCAA CGAGGACCTC TACCGGGAGC AACGGATGCT GCTGCGCCTG GCCAGAGAGG TACACGTCCC GGGCGCGTTC AGCGACGGTG AGCAAGGCCG AGCCGAGCTG GCCGCGCAGG CCGATCCCAA ACTGCGGACC GCGCTGGCGA TGACCGGCTA CGGGCTGACC CAGACCCTGG CCGCCGCTCC GCAGCTGGTC GCCCGCTGGG AACGGGCCCG AGGCGCTGGC GGGCCGAGCC GTGGGCCCTA CCGGTGGGCA GTGCTGACCG CCGCGCTGGA TGCCGCCCGC CTGGGCGCCC GCGGACCCCT GCCGACCCGG CTGCTGGAGG CGGCAGCGCC TGGCTACCTC GACGACCATG AACGGGCGCG TGCGCCCGCC GACTGGTTCG ACGACGCCCG CGCCTATGCC ATTGACAACA CCACGATGCA CGGCGCCGCC GCCGCCCTGG AGCCAACCGG CCCCGGCGGC ATGGGCCAGG TCACCGGCTA CACCCCCGCC GACTACCTCG TCCAGTACGC CACCCGTACC CGTCGCCGCG AGCGGCCACC CGCCAGCCTC TGGATCGCGC TGCGCGACCA TCTCACCGAT CCCGCGGACG TCCACCGCGT CGCTGATGCC GCGTTCGACC GCTACCAGTA CGGCGCCGCG GTCCCGCTCC TGCATAAGGC TGCAGATGCT GGCTACCGGT CCTCGGCCGG TCGGCTCGCC GGGCTGCTGC TCGAGGTCGG GGATGTCGAC GGCCTGCGTG ACCGAGCCGC GGCCGGCGAC GGCGAGGCGG CCCGCGTGCT CTCCCAGCAG CGGACCAGAG CCGGGGACCT TGAGGACGCG GCCGATGTCC TGCGGCCGGC TGCCGACGCA GGCGACTGGC GTGCCGCCGC CCGGCTCATC GACCTGATGC TGGAGGTCGA TGACCTCGAC GGGCTGGCCG CGCGTGCCGA AAAGGGAGAC AAGGAAGCAG CAGTCGCTCT TGCCCTGCGT TTAGCCGAAG CCGGGGACGT CGACCAGCTG CGAGCACGCG CCGACACCGG CGATCCGGCG GCCGCCGGGA CACTGGCCGA AATACTCGCC AAGGCCGGCG ACGTCGATGA ACTGCGCGCC CGGGCGGGCG CGGGAGACAT CTGGGCCGCT GACTGGCTCG CCGAACTGCT TGTCAACGCC GGCGACCGTG CCGGGGCGAT CGCCGTTCTG CGTGCCGCCG CCCGCGACGG CGTCGAGGAG GTCAAGGATG TCGACCAGTT GGCCTGGCTC CGGGCCTGCC TTGCCGAGGT GCTCCTCGAG ACCGGGGATC ACGACGGTGC GATCGCCGTG ATGCGCGCCG CCGACATGAG CGACCCCGAC GCCGTTGACG ACCTGGTCGA CGTGCTGGTC TCCGTCGGTG ACCGGGACGG GGCGATGACC ATCCTGCGTC CGGTCGCCGA CGCGGGTGGC CGGCACGCGG CCACCCATAT CGCCGAGCTG CTGGTCGAAG GGGGCCACGT CGACGAACTA CGGGACCGTG CCCTCGGTGG CGACCAGGAG GTCGTCTTCA GACTGGTGGA CGTTCTGCTC GCCGCGGGAG ATCGCGCCAG CGCCATCACC ATCCTGGCGG CCGGCGCCGA CGCTGGCGAC TGGGAGGCCG CCGACTGGCT CGCCGCGCTC CTGCTCGAGG ACGGCGATCG TGACGGTGCG ATCACCGTCC TGCGCGATCA CCTCGACGCC GACGACGGTG GTAACTGGAT CATCCGCCAA CTCGTCAGGC TCCTTCACGA GGCTGAGGAT CACGAAGGCC TGATGGATGT CCTGCGCGCC CGCACGTCCA CCGGCGACAC CTGGGCGACA CAGCAGCTCG TTGAGCTCCT GGTCGAAGCT GGAGACGATG AGGGCGCACG TGCTGTCCTT CGCGCCCGCG CGAATGCCGG TGACGCGCGC AGCGCCTTCC GGCTCGTGAC CATGTTGACT GCGGCCGGAG ACCGCGCCGG TGCGATCGCC GTCCTGCGCG CCCATGCCGA CGCTGGCAAC GGAGTAGCCG CATTTGAACT CGCCGCCCAG CTGAGTCGGG ACGGAGAACT CGACGAACTA AGTGCCCGTG CGACCGCCGA CGACACCTGG GCCGGCGCCC GGCTGCACGC GGTACTGAGC GCCGATGATG ATCAGCGGCT GCACCGCTAT GGCCTCACGA TGGAGGGGGA GGTCGCCGAC GGCCCTACCT GGTGA
|
Protein sequence | MSGRPGDGGS GQPSRRQRPA DHRAGCTGAH VERDRSLDAV VVPDPSGVNS PEGLAAALGA LRRRAGLSVR DMEMTARKRG LSLPRTTASD AENSARPRPS KPTVSAFLDV CDVPAAERVQ WLAAWERAGS KVKDPPAGWV RVEDADPYRL GVHQPIQVDG AKDDDLPTYV ARDVDEAVGG VRSRLAAAAE HGGLVLLVGD SSVGKTRTAY EAVRAELPGW WLVHPADAVE VAALVARRPR RLVVWLDEIQ NYLDADPGLT AGVLRDLLDG AGPVVVVATI WPYWHSLYTS LPSRDGNEDL YREQRMLLRL AREVHVPGAF SDGEQGRAEL AAQADPKLRT ALAMTGYGLT QTLAAAPQLV ARWERARGAG GPSRGPYRWA VLTAALDAAR LGARGPLPTR LLEAAAPGYL DDHERARAPA DWFDDARAYA IDNTTMHGAA AALEPTGPGG MGQVTGYTPA DYLVQYATRT RRRERPPASL WIALRDHLTD PADVHRVADA AFDRYQYGAA VPLLHKAADA GYRSSAGRLA GLLLEVGDVD GLRDRAAAGD GEAARVLSQQ RTRAGDLEDA ADVLRPAADA GDWRAAARLI DLMLEVDDLD GLAARAEKGD KEAAVALALR LAEAGDVDQL RARADTGDPA AAGTLAEILA KAGDVDELRA RAGAGDIWAA DWLAELLVNA GDRAGAIAVL RAAARDGVEE VKDVDQLAWL RACLAEVLLE TGDHDGAIAV MRAADMSDPD AVDDLVDVLV SVGDRDGAMT ILRPVADAGG RHAATHIAEL LVEGGHVDEL RDRALGGDQE VVFRLVDVLL AAGDRASAIT ILAAGADAGD WEAADWLAAL LLEDGDRDGA ITVLRDHLDA DDGGNWIIRQ LVRLLHEAED HEGLMDVLRA RTSTGDTWAT QQLVELLVEA GDDEGARAVL RARANAGDAR SAFRLVTMLT AAGDRAGAIA VLRAHADAGN GVAAFELAAQ LSRDGELDEL SARATADDTW AGARLHAVLS ADDDQRLHRY GLTMEGEVAD GPTW
|
| |