Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6728 |
Symbol | |
ID | 5675041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8180935 |
End bp | 8182908 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245577 |
Product | Type IV secretory pathway VirB4 protein-like protein |
Protein accession | YP_001510968 |
Protein GI | 158318460 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.726364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.776114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCC GCCGCAACCG CCTACACGTC GCCCCGCAGG GTGCGCCGTC CCGCAGCCGC CGGCCCCTGG TCCCCGGCCG CGGGCAAGGG GCGGGGCAGA TGCTGCCCGT CGAGGGCCCG GCGCTGTTCA CCCCGCCCGC GCTGGTGGTC GATGCCGGGC AGATCGAGGT CAGTGGGATC TGCGCGACCA CGATCACCGT GGTCGGCTAC CCGCGTGAAG TCGGACCCGG GTGGATGGAG CCGCTGCTGG CCTACCCGGG CCGCCTCGAT GTCGCCCTGC ACATCGACCC GACCCCACCC GCGGTGGCGG CGCTGCGACT ACGCCGCCAA CTCGGCCGGC TCGAATCCGG CCGCCGCGCC GACGCCGCGG CCGGCCGTCT CGCCGACCCC GAGCTCGACG CCGCCGCCCA AGACGCCGGG GAGCTGGCCC GTCAGGTCGC TCGTGGTGAG GCACGCCTGT TCCGGACCGG GCTGTACCTG ACCGTCTACG CCGATAGCCG CGAGGAGCTC GCCGAGGAAG CGGCGCGGGT CACCGCGCTG GCGCACAGTC TGCTGCTCAC GGTGCGCCGG GCCCGGTACC GGAGCGTGCA GGGCTGGGTG AGCACCCTGC CCCTGGGCCT GGACCTGCTC CAGATCCGCC GGGCGATGGA CACCCAGGCG CTGGCCGCGG GGATCCCGTT CACCACCCCT GACCTGCCCC TACCCGACGT GGAACGCCCC GGAGCCGCAC CCGTCGTCTA CGGCACGAAC CTGCACTCCG CCGGGATCGT GGCGCATGAC CGGTGGGCGC AGCCGAACTA CAACTCCGTC ACCACCGGCG CCTCCGGCGC CGGCAAGAGC TTCCTGATGA AACTCGACCT CCTGCGCTCC CTCTACCAGG GCGTCGAGGC CGCCATTGTT GATCCGGAGG ACGAGTACGG CCGGCTCGCC GCCGCTGTCG GGGGCACCCG CCTCGCGCTC GGCGAGCCTG GGGTGCACCT CAACCCCCTC GACCTGCCCG CCCACTCCCA TCACGACCCC GACCTCCTCA CCCGCCGCGC CCTGTTCTGC CACACCCTGA TCACCACCCT GCTCGCCGGC ACCGATGACG ACAGCGGGCT GGGAGCCGGG GGCCGGGCCG CGTTGGACGC GGCGATCCTG AGCGCCTACC GCGCTGCCGG GATCACCCAC GACCAGGCCA CCTGGACCCG GCCCGCCCCG CTGCTCGCCG ACATCGCCGC CGCCCTCACC GCCGCTGAGG ACCCGGCCGG CCCAGCACTC GCCGCACGTC TCGCCCCGTT CGTCACCGGC TCGCATGCTG GCCTGTTCGC GCACGCGACC ACCACCAGCC CGACCGGGCA CCTTGTCGTC TACTCCCTGC GGGCGCTGCC CGACGAGCTG AAAGCCGCCG GGACGCTGCT CACCCTGGAC GCGATCTGGC GGACCGTCGC CGACCCGGCC CGGCGACGGC GCCGTCTCGT GGTGGTGGAC GAGGGCTGGC TGCTCATGGC ACACCCCGCC GGCGCCCGCT TCCTGTTCCG CCTCGCGAAG GCCGCCCGGA AGCACTGGGC CGGTCTGGCC GTGGCGACCC AGGACTCCGC CGACCTCCTC AGCTCCGAGC TGGGCCGGGC GGTCGTCGCG AACGCGGCGA CGCAGATCCT GTTGCGCCAG GACCCGACCG TGATCGACGA CCTGCGTCGC GTGCACCGGC TCACCGATGG CGAAGCCACC CAGCTGCTCA CCGCCGGTCC CGGTGACGCC CTGCTCCTGA CTGGGACCGG GCAGCGCACC GCCCTGCACG CCCTCGCGTC CCCCGCCGAA TACGACCTGA TCACCACAGA TCCGGTCGAC ACCACCACCG CCACTCCCAC CGACACGGGC CCACTCGACC CGGCCTGGGC CGAGGACCCT GCCGTCGCGG CGACCGGGCC GGCCCGCGCT CCTCGCCGGC CGGCCGCCCG GCGGCCGGTG GACGACGACG CCGACCCGTT CTAA
|
Protein sequence | MSRRRNRLHV APQGAPSRSR RPLVPGRGQG AGQMLPVEGP ALFTPPALVV DAGQIEVSGI CATTITVVGY PREVGPGWME PLLAYPGRLD VALHIDPTPP AVAALRLRRQ LGRLESGRRA DAAAGRLADP ELDAAAQDAG ELARQVARGE ARLFRTGLYL TVYADSREEL AEEAARVTAL AHSLLLTVRR ARYRSVQGWV STLPLGLDLL QIRRAMDTQA LAAGIPFTTP DLPLPDVERP GAAPVVYGTN LHSAGIVAHD RWAQPNYNSV TTGASGAGKS FLMKLDLLRS LYQGVEAAIV DPEDEYGRLA AAVGGTRLAL GEPGVHLNPL DLPAHSHHDP DLLTRRALFC HTLITTLLAG TDDDSGLGAG GRAALDAAIL SAYRAAGITH DQATWTRPAP LLADIAAALT AAEDPAGPAL AARLAPFVTG SHAGLFAHAT TTSPTGHLVV YSLRALPDEL KAAGTLLTLD AIWRTVADPA RRRRRLVVVD EGWLLMAHPA GARFLFRLAK AARKHWAGLA VATQDSADLL SSELGRAVVA NAATQILLRQ DPTVIDDLRR VHRLTDGEAT QLLTAGPGDA LLLTGTGQRT ALHALASPAE YDLITTDPVD TTTATPTDTG PLDPAWAEDP AVAATGPARA PRRPAARRPV DDDADPF
|
| |