Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4251 |
Symbol | |
ID | 5672606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5065935 |
End bp | 5068634 |
Gene Length | 2700 bp |
Protein Length | 899 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243124 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001508541 |
Protein GI | 158316033 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.553547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.492439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAGGGTT GGAATGACGC CGCGGGCGTG AGTGTTGGCT ATTTCATTTC CTACGCGGGG TCTGACCGGT TGTGGGCGGA GTGGGTCGCG GCTGAGCTCG AGACGGCAGG TGAGACCGTG GTGCTGCAGG CGTGGGACGC GGTTCCCGGC GAAAACATTG TCGTCTGGAT GAGCCGCTCC ATGGCGGCGG CTCGGCGGAC CATCGCGTTG TACTCGCCCT CCTATTTTGA ATCGAGCTGG TGTACGGCGG AGTGCACGGT GGCGTTGAGT CGGCAGGTGC TGCTGCCGTT CAAGGTAGCG GAATGCGACC CGCCCGCGGT GCTTGCCGCA ATCGGGCACA TCTCGCTCCA CGGGGTGGAC GAGGCCGCCG CACGACGGAA ACTGCTGCGA GCCGCGGGCC TGGAGGAGAC CCCACGGAGG TTCGACGGTC GATTTCCTGG CGGGTCAGCG CGCCGGGCGA GCGCGGGGAA CGATGCCGAC GAAGCACCTG TCGTGCCGTT TCCTGGGTCC CTGCCCAAAA TGTGGAATGT GCGTTGGCGC CGTCCCACGT GGTTCGTGGG TCGGGACGCG ATGCTTACGG GCATGTACGA CAGGTTTCGG GCAGCGGGTG TCGACAGAGT CAGCTCCCAG GTGGTGATCG GGATCGGCGG AGTCGGGAAG ACGCAGCTCG CGGTCGAGTA CGCCTACTGT TTTGCGGCCC GGTACTCACT GGTCTGGTGG GTGGATGCGG CTGCGTCGGC GGCGGTGGTG GAGTCGTTCC ACGGGCTGGC CGACGCGCTC CGGTTGCCCG ACGACCCCGA TGTTGAACGG CGGGCCAGGC GGGCACTGGC ATCGTTGCGT GACCGGACCG GCTGGCTGGT CGTGTTCGAC AACGTCGAGG ACCGTCACCT GCTCGCCGAC TGGTGGCCCG TCGCCGGCCG GGGAGACGTC CTGGTAACGG GCCGTAGTCG CATGTTGGGA GAGTTCGGCG AGATACGCAC TGTCGTACCG TTCTCACCCG ACGAGGCGGC GTCGCTGCTG CGGTGCCGCG CCGACCACCT CTCGGAGTCC GACGCCCTAC GCGTTGCAGA GGTGCTCGGC CACCTTCCGC TCGCCATCAG CCAGGCCGCG GCCTACCTCG CCACGACCGG GGTCAGCGCC GACGATTACC TCGAGCTGGT CGCGACAGCT GTGTCGACCG CGTTCGCCGA CAGCCCGTCC GACTACCGGG CCGGCCTGCT GGGCTCGGTG GCCACCGCGA TGGACCGGCT CGTGCGCAAC GACCCACCGG TTGCGCAGGC ACTGCGTCTC GCGGGGTTCC TCGCTCCAGC GCCGCTGGCA TCACACGTAC TGGACGCCGT CACGGCGGCG GTTCTGCCTG AGCTGCCACC GGTCGTCGCG AGGACCCGGG TACTTCGCGG CATCGACACG TCCGCGCTGG CGCAGGTCTC CAGCGGGACC TTCGAGCTGC ACCGGCTCAC CCAGGCCGTG CTGCGCGCCC AACTCGCCGT CGTGGACCGG GAACGGACGA TCGCGCAGGC CACGGACGTC CTGCTCGCGG CGGCGCCGGC CGACGCTGGC GACCCCGCTA CGTGGCCCGT GTTCGCTGAA CTCGCGGCGC ATGTGCCGGT GCTGTTCCGG TATGTGGACG GCGGAGGCCG CCCGGCGTTG CGTGAACTGG TGCTGGCGGT CGTCGACTAC CTGACGAGGA CCGGCCAGCA CGCCGCGGCG GTCCGTCTGG CGGGCACGGC GGTCGACACG TGGACGCGGC TTGGCGGGCT GGACAACCTT GATCGGCTCG CCGCCGCGCA CCGCCAGGGC GAGGCGCTAC GCGGGGCAGG GCGCTTCGGC GAAGCCGAGG TCGTCGACCG CGACACCCAT GCGCGGCGCC TGCGGGTCCT CGGCGCCCAG AACCGGGAGA CGTTGCGTTC AGCGGGCGCG GTCGGACTCG ACCTGCGGGG TGTCGGGGAC CGGGCCGGTG CCCGTGACTG GAACACCGCG GCGCTGGCTA CCGCGCGTGC CGTCCTGGGC GGTGATGACC CGCAGACGCT GGAGATCGCC GGCAGTCTTG CCCTTGACCT GCACGGTCTC GGGGAGGTAG CGGCGGCACG TGAACTTGAT GAGGAGGTCC TCGCCGGGCG ACGTGCCGTG CTGGGTGAGA CTCACTGGCA GACTCTGTCG TCGGCCCGCA ACCTCGCCCG AGATCTGCGC GCTCTCGGCC TGCAGGAGCA GGCCCGCGAC CTGGCGCAGT GGACCTTGGA GACCTCGCTT CGGGTACTCG GGGCGGACCA CCCCGACACA CTGCTGGCCG CCAGCAGCCT CGCGGTGCTG CACTACGTCC TCGGCGACCT CGAAGCGGCC CGTGATCTGC ACCAGGACTC GCACAGCAGG TCGAGTCGGG TCCTTGGCCC GGACCATCCG CATACTCTGC GCATCGCGAA CAGTCTCGCC GTTGACCTGT TCCGGCTCGG TGACCTGCAG GCCGCGCACG ACCTGCACCG TGACACCTTC GACCGGCTTC GCCGCGCCCT CGGTGACGAC CACCCGGAAA CCCTGCACGT GGCCCACAAC CTTGCCCGGG ACCTGGGCGG GCTCGGCCGG TACGACGATG CCGTCCGACT CCTCGAGGAC ACCCTCCGCC GCCGCAGATC CGTGCTCGGA TCCGAACACC CTGAGACCCG CCGCACCGAA AGACGCCTCG CCAGAACTCG CGGCAGGTGA
|
Protein sequence | MQGWNDAAGV SVGYFISYAG SDRLWAEWVA AELETAGETV VLQAWDAVPG ENIVVWMSRS MAAARRTIAL YSPSYFESSW CTAECTVALS RQVLLPFKVA ECDPPAVLAA IGHISLHGVD EAAARRKLLR AAGLEETPRR FDGRFPGGSA RRASAGNDAD EAPVVPFPGS LPKMWNVRWR RPTWFVGRDA MLTGMYDRFR AAGVDRVSSQ VVIGIGGVGK TQLAVEYAYC FAARYSLVWW VDAAASAAVV ESFHGLADAL RLPDDPDVER RARRALASLR DRTGWLVVFD NVEDRHLLAD WWPVAGRGDV LVTGRSRMLG EFGEIRTVVP FSPDEAASLL RCRADHLSES DALRVAEVLG HLPLAISQAA AYLATTGVSA DDYLELVATA VSTAFADSPS DYRAGLLGSV ATAMDRLVRN DPPVAQALRL AGFLAPAPLA SHVLDAVTAA VLPELPPVVA RTRVLRGIDT SALAQVSSGT FELHRLTQAV LRAQLAVVDR ERTIAQATDV LLAAAPADAG DPATWPVFAE LAAHVPVLFR YVDGGGRPAL RELVLAVVDY LTRTGQHAAA VRLAGTAVDT WTRLGGLDNL DRLAAAHRQG EALRGAGRFG EAEVVDRDTH ARRLRVLGAQ NRETLRSAGA VGLDLRGVGD RAGARDWNTA ALATARAVLG GDDPQTLEIA GSLALDLHGL GEVAAARELD EEVLAGRRAV LGETHWQTLS SARNLARDLR ALGLQEQARD LAQWTLETSL RVLGADHPDT LLAASSLAVL HYVLGDLEAA RDLHQDSHSR SSRVLGPDHP HTLRIANSLA VDLFRLGDLQ AAHDLHRDTF DRLRRALGDD HPETLHVAHN LARDLGGLGR YDDAVRLLED TLRRRRSVLG SEHPETRRTE RRLARTRGR
|
| |