Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2745 |
Symbol | |
ID | 5671136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3248284 |
End bp | 3250809 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641241657 |
Product | Type IV secretory pathway VirD4 protein-like protein |
Protein accession | YP_001507077 |
Protein GI | 158314569 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0325821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATCG TGACCATGGA CCTGCTCGCC GCGGCGGCCA CCACGCCGTC GTCGCCACTC ACGACCTACC TGACCGACCC CGCCGGCTTC CTCCACCAGC TGCTCGGCCA CCTACGTGCC TGGGCCGTGG TCTGGGGACC CGTCGCCGGC CCGCTGGTCG CTCTCACCGC CGCCGGCCTG CTCACCCTGC GCCGGCGGCT ACGCCGCCGC TACCAGCAAC GACTCACCGC CGGCGCCCGC CTCGTGACCG TCCTGGCCCC GCCCACCGTC GACCCGGCGG GCGCGGGCGC GCTGTGGGCG AACCTGCTCG GCCTGCTGCG CCCGTCCTGG CGACGGCTGG TCGGCCAGCC GCACCTCGTG TGGGAGTACC TGTTCGACGC CGACGGGGTC CGCATCCAAA TCTGGGTCCC CGGTGTGGTG CCCGAGGGCT TCGTGGAGAG GGCCGTCGAG GCGGCCTGGC CCGGTGCCCA CACCCGCACC ACCCCTGCCC GAGCGCCGCT GCCCGTCCTG GCCCGGCCCG GCCGGCGGCT GCTGGCCGCA GGCGGGGAAC TCCGCCTGGG CCGGCCGGAA GCGCTCCCGA TCCGTACGGA TCATGACGTT GACCCGGTCC GTGCACTGCT CGCCGCGCCC GGCGGGCTGG CGCGCACCCA GCGGGCGGTC GTGCAGATCC TGGCCCGGCC GGTCACCGGC CGCCGCGTCG CCAAAGCCCG CCGGTCAGCC CGCCGCGTGC GTGCTGGCGG CTCGGCCACC CTGCTCGGCG GGCTGCTCGA CCTGCTCACC CCCCACACGG GCCGAACCCG GCGCACTCGG CGGACCCCCG CACCGACGAA GGTCGATCAT CAGACCTCGC TCGCGCTGTC GGCGGAGGAC CGCGCGATCG TCACGAAGGG TCGCGGCGCC CAGTTCGAGG TCCGTGTCCG CTACACCGTC GTCGCCGTTC TCGACGACAC CGCTGACGAG GACACCGCCG CGCGGGTCGG CGGGCAGCTG CGCGGCCGGG CACACGCGAT CGCCTCGGCG TTCGCGGCCT ACGGCGAGCA CAACTACTTC CAACGCGCCC GGGTGCGCCG CCCGCTGCCC GTCCTCGCCG CGCGCCACCT CGGCCGCGGG GACCTGCTGT CCGTCGCCGA GGTCGGCGTG CTCGCCCACC TGCCGGTCGA CGAGGCGACT CCCGGCCTGC AACGCGCCGG CGCGAAAGCC GTCGCCCCGC CACCCGGCGT CGCTGGCGCC GCGCCGAATG TCCGTCCACT CGGCCGTTCC GACGGTGGAC ATGCCCGCCC GGTCGGTCTG CGGGTTCCCG ACGCCCGCCA CCACCTACAC ATCCTCGGCG CCACCGGCTC CGGCAAGTCC GAACTGCTCG CCCGCATGAC CTTGGACGAC GTCGCCGCGC GCCGGGGCGT GGTCAACGTC GACCCGAAGG GCGACCAGGT CATCGACATC CTTGCCCGCT ACCCCCTCGA CGCCGTCGAC CGCCTCGTCC TGTTCGACGC CGAATCGTCG GGCCGGCCGC CGTGCCTCAA CCCGCTTGAC CAGCCCGACC GGACACGCGC CGTCGACAAC CTGGTCTCCA TCTTCTCCCG GGTCTACCAC GAGTCCTGGG GCCCACGGAC CGACGACATC TTCCGCGCCG GACTGCTCAC CCTCGCCGCC CAACCCGAGG TCCCTGTCCT GACCCAGCTA CCCCGGCTCC TGACCGACGG CGCCTACCGG CAGCGCCTCG TCGGCGAGAT CAAAAAAGGC GACGGCAACG ACATCCTGGC CGGCTTCTGG CAGTGGTACG AAGCACTCTC CGAACCCGCG CAGGCGCATG CCGTCGCCCC GCTGATGAAC AAACTCCGCG GGTTCCTACT ACGGCCGTTC GTGCGCGCCG CGATCGCCGC CGGCCCCTCG ACGGTGGACA TGGACACCGT GCTCAACGAC GGCGGGGTCT GCCTGGTCCG CATCGCCCAA GACGCCCTCG GGGTCGAGAC CGCCGCGCTC ATGGGCTCCA TCGTCGTGTC CGCCGTCTGG CAGGCCACCA CCCGGCGCGC CCGCATGCCC CAGGGAAAGC GACCCGACGC CAGCCTGTAC TTGGACGAAG CACACAACTT CCTCACGCTT CCGTACGCGC TGGAAGACAT GCTCGCCGCC GCCCGCGGCT ACCGACTCGG GATCACTCTC GCGCACCAGA ACCTCGCCCA GCTGCCCCGG CACCTGGAAG AAAGCATCGC CGCGAACGCC CGCAGCAAGA TCTACTTCAC GATGTCGCCC GCGGACGCGA AACGGCTCGT CCGCCACGTC GAGCCTCGCC TGTCTGAGCA CGACCTGGCC AACCTCGGCC GCTTCCATGC CGCCACCCGC CTCGTCGTCG TCGGCGAAGA GGCACCGGCG TTCACGCTGC GCACCGAGAA GCTCCCCGCC CCGGTACCGG GCCGCGCCGC GCAGATCCGC CGCGAGCTGC GCCGCCGCGC GCCCACCCCC ACACCGACGC CGCCGGACCC GGCGGCCCCG CAGCCGCAGG CCGACCCCCG CCGATCCGCC CGCTGA
|
Protein sequence | MEIVTMDLLA AAATTPSSPL TTYLTDPAGF LHQLLGHLRA WAVVWGPVAG PLVALTAAGL LTLRRRLRRR YQQRLTAGAR LVTVLAPPTV DPAGAGALWA NLLGLLRPSW RRLVGQPHLV WEYLFDADGV RIQIWVPGVV PEGFVERAVE AAWPGAHTRT TPARAPLPVL ARPGRRLLAA GGELRLGRPE ALPIRTDHDV DPVRALLAAP GGLARTQRAV VQILARPVTG RRVAKARRSA RRVRAGGSAT LLGGLLDLLT PHTGRTRRTR RTPAPTKVDH QTSLALSAED RAIVTKGRGA QFEVRVRYTV VAVLDDTADE DTAARVGGQL RGRAHAIASA FAAYGEHNYF QRARVRRPLP VLAARHLGRG DLLSVAEVGV LAHLPVDEAT PGLQRAGAKA VAPPPGVAGA APNVRPLGRS DGGHARPVGL RVPDARHHLH ILGATGSGKS ELLARMTLDD VAARRGVVNV DPKGDQVIDI LARYPLDAVD RLVLFDAESS GRPPCLNPLD QPDRTRAVDN LVSIFSRVYH ESWGPRTDDI FRAGLLTLAA QPEVPVLTQL PRLLTDGAYR QRLVGEIKKG DGNDILAGFW QWYEALSEPA QAHAVAPLMN KLRGFLLRPF VRAAIAAGPS TVDMDTVLND GGVCLVRIAQ DALGVETAAL MGSIVVSAVW QATTRRARMP QGKRPDASLY LDEAHNFLTL PYALEDMLAA ARGYRLGITL AHQNLAQLPR HLEESIAANA RSKIYFTMSP ADAKRLVRHV EPRLSEHDLA NLGRFHAATR LVVVGEEAPA FTLRTEKLPA PVPGRAAQIR RELRRRAPTP TPTPPDPAAP QPQADPRRSA R
|
| |