Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3441 |
Symbol | |
ID | 5671812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4071324 |
End bp | 4073876 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242329 |
Product | TPR repeat-containing protein |
Protein accession | YP_001507749 |
Protein GI | 158315241 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAGG AAGCCGGCTG GGACTTCTTC GTGTCCTATG CGGCGGACGA TGTCGAGTGG GCCGAGTGGA TAGCGTGGCA TCTGGAGGCT GCGGGCTATC GCGTCCTGTG TGAGGCGTGG GACGCCGTGG CCGGATCGAG CCGCGATCGC CTCATCGACG ACGCCGTCGG CAGGTCGGTG AGGACGCTCG CCGTGCTGTC GGCGGCATAT CTTCGCGCAC CTTCGGTGCA GAGCGAATGG CGCGCAGCGT GGCGCAGGGA TCCCGATGGG CTGACCCGCA GGCTGATTCC GGTCAGGATC GATGCCTGTG AACCGGAAGG TCTGCTTGGC GGCGTCGTTC CCATCGACCT GTTCGGACTG GACGAGGCCG GCGCGCTGGC GCGTCTGCAG ACCCGGATAG ACGGCGCCCG CGCGGGTCGG CAGAAGCCCG CTGGTGAGCC ACCCTTCCCC GCCCGCGGCG GGAACGGCCG CCCGGACGGG TCGCAGCCCA CGCCTCGTCC GCCGGCTCCG CAACCAGCTC GTCCAGGACA TGAAACACCA GCCCTGCTGC CCAGGGACAT CCGGGACTTC ACAGGTCGCA GGCGCGAGAT CGCCGAGCTG GACGCGATGC TCGGGCCGGG CACATCGACG ATGGTCATTT CCGCGGTGGA CGGGACGGCC GGCGTCGGGA AAACCACGCT CGCCGTCCAC TGGGGCCACC GGGTCAAGGA CCGGTTCCCG GACGGCCAGA TCTACCTCGA CCTGCGAGGG TACGCACCGA CACTGCCGAT GCAGCCGCGC CGGGCGCTCG GCCTGCTGCT CGGCGCCCTC GGCGTCACGG ATGCCGACAT CCCGATGACG CTGGAGGCGC GCTCGTTGCT GTACCGGCGG CTGACGGGAA GCCGCCGCAC GCTGCTGGTC CTGGACAACG CCCTCGACGT GGAACACGTT CGGCCGCTTC TGCCCGCCGG CACCTGCCTC GCGTTGATCA CCAGTCGGAG CAGACTGAGC GGGCTGGCGG TACGCGACGG AGCCCGCCTG ATCTCGCTGG ACGTCCTCAG CCCGGACGAG TCGACCGCAC TGCTGCGGCA GGTTCTGGGC GAGGTCCGCG TGGAACGCGA GCTCGCGGCG GCCGAGCGGC TCGCGCAGCT GTGTGGACAC CTTCCGCTGG CGCTGCGGAT CGCGGCGGTC CGACTGCTGA CGAGGCCTGG CTTCGACATC GCGGACGCCG TCTTCGAGCT GGCCGACGAA AACGCGCGGA TCGACGTGCT ATCGCGGGAC TCCGACGAGC ACGCCGCCGT TCGTAGCGTG TTCTCCTGGT CCTATCTGAG GCTGCGCCCG GTCGAGCAGC GAGTGTTCCG GCTCCTTGGT CTCCACCGGG GCCCATCGCT GAGCGACGAC GCCGTCGCGG CGCTTGTCGG CGTGCCCACG GCACAGGCCG GTGCGATGCT CCATACGCTC GTCGCGCTGC ATCTGATCGA GGAGGAACGC CCTCGCCGCT ATCGCATCCA TGACCTGCTC CGGCTGTACG CGGGCGAGCT GTGCACCCGG GACGAGCCCG CGCAGCAGCG GCGGGCTGCC GTCGAGCGCA TCCTCTACTG GTACGTGGCC ACCTGCACCG CCGCCGCGAC CCTCGTGGAG GGGCGGACGT CGGACAAGTC GGCGCCGCTC GCGCCACATG CCGTGGTGCG GCCGCTCGTG TTCGGCTCGG CCAGCCAAGC GCTCGCCTGG TTCGACGACG AGTACGCCAC GATGCTCGAC ATCGCCAACC ACGCGTACAG CAACGAACTC GACGACCTCT GCGGTCGGTT GGCGCTGGCG GCGTGGCCCT TCTTCCAGCG CCGCAGCCGT TGGGCGGACT GGATCGAGTT GCAGCAGGTC AGCCTGGCGG GCGCGCGGCG ATCCGGCGAC GAGCAGACCG AGGCCTGGCT GCTCGGCGGC CTCGGCGACG TTCTCGACGA CCAGGAGCGG TACGAAGAGG CCCTGGAATG CCATCAGCGG GCCATCGACA TCCATCGCCG GCTCGGGAAC CCGAAGGGTG AGGCCGTCGC GCGCAACAAC CTCGCGGTCA GCCTCGACAA CCTTGAGCGG TACCCGGAGG CGATCGAGCA CTACACCGCC GTGCTGGCGA TGTTCCGCGA CCTCGGCGAT CTCGCCAACG TCGGAATGGT CCTGAACAAT CTCGGCGCCG CGCACTTCAT GATTGACCAG TTCGATGCCG CGGAGCGTCA CTACCGGGAG GCTCTGGAGA TCCGCCAGGC CCTCGAAGAC GCCTTCGGCG AGGGCATGAC ACTGCACAAC CTCGCCGACG TCGCGGAGGC GCTGGACCGC CTCGATGAGG CACGTGACTG GTACGAGCGC TCCATTCCCC GGCACAGGGC GGCCGGGCAC CTGCGCGGCG AGGCACGGGC ACTGCACTTC CTGGGCCGGG TGAACCAGCG TCAGGGAGAC CTAGAGACCG CCCGCGCCCA CTGGCGCGCG GCACTCGACA TCTTCGAGCG CGTCGGCGAT CCGGAAGCTG ACGACCTGCT GGCGCTGCTG GCCGACACCA GGACGAGCAC GCCGGGCGCG TAG
|
Protein sequence | MGEEAGWDFF VSYAADDVEW AEWIAWHLEA AGYRVLCEAW DAVAGSSRDR LIDDAVGRSV RTLAVLSAAY LRAPSVQSEW RAAWRRDPDG LTRRLIPVRI DACEPEGLLG GVVPIDLFGL DEAGALARLQ TRIDGARAGR QKPAGEPPFP ARGGNGRPDG SQPTPRPPAP QPARPGHETP ALLPRDIRDF TGRRREIAEL DAMLGPGTST MVISAVDGTA GVGKTTLAVH WGHRVKDRFP DGQIYLDLRG YAPTLPMQPR RALGLLLGAL GVTDADIPMT LEARSLLYRR LTGSRRTLLV LDNALDVEHV RPLLPAGTCL ALITSRSRLS GLAVRDGARL ISLDVLSPDE STALLRQVLG EVRVERELAA AERLAQLCGH LPLALRIAAV RLLTRPGFDI ADAVFELADE NARIDVLSRD SDEHAAVRSV FSWSYLRLRP VEQRVFRLLG LHRGPSLSDD AVAALVGVPT AQAGAMLHTL VALHLIEEER PRRYRIHDLL RLYAGELCTR DEPAQQRRAA VERILYWYVA TCTAAATLVE GRTSDKSAPL APHAVVRPLV FGSASQALAW FDDEYATMLD IANHAYSNEL DDLCGRLALA AWPFFQRRSR WADWIELQQV SLAGARRSGD EQTEAWLLGG LGDVLDDQER YEEALECHQR AIDIHRRLGN PKGEAVARNN LAVSLDNLER YPEAIEHYTA VLAMFRDLGD LANVGMVLNN LGAAHFMIDQ FDAAERHYRE ALEIRQALED AFGEGMTLHN LADVAEALDR LDEARDWYER SIPRHRAAGH LRGEARALHF LGRVNQRQGD LETARAHWRA ALDIFERVGD PEADDLLALL ADTRTSTPGA
|
| |