Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2744 |
Symbol | |
ID | 5671135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3246372 |
End bp | 3248264 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241656 |
Product | Type IV secretory pathway VirB4 protein-like protein |
Protein accession | YP_001507076 |
Protein GI | 158314568 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00908166 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAC GATCCCGACG CCGCACCTCG GCGCAGGCCC CCGGCCCGTC GGCAGGGCCG CTGGACGCCG CAGCGGCGGC CTTTGTCCCG GACGCGCTGT CGATCGCGCC CCGCCACCTG GACGTGGGCG GGGACTTCCT TGCCACGATG GCGATCACCG GCTATCCGCG TGAGGTCCAC GCGGGCTGGC TCGCCCCGCT GGTGACCTAC CCGGGCCGGG TGGATGTCGC GGTGCATGTC GAGCCGATCG ACCCGGTCAC CGCGGCGAAC CGGCTCCGCC GGCAGCTGGC GAAGCTGGAG TCCGGCCGCC AGCTCGGCGA TGAGAAGGGC CGGCTGGTCG ACCCACAGGT CGAGGCGGCG ACCGAGGACG CCTACGACCT GTCCGCCCGC GTCGCCCGCG GCGAAGGGAA ACTCTTCAGG CTTGGTCTCT ACCTGACCGT CCACGCGTCC AGCGAGAACG AGTTGGCTGA CGAGGTCGCG GCGGTGCGGG CGCTCGCAGC CAGCCTGTTG CTGGATGCCA AGCCGGTCAG CTACCGGTCG CTGCAAGGCT GGGTGAGCAC CCTGCCGCTG GGGTTGGACC AGGTGCGGAT GCGCCGCACC TTTGACACCA CCGCCCTCAG CGCGGCGTTC CCGTTCACCT CACCTGATCT GCCGCCGCCC GACCCGACCT CGCTGGCCCC GACCGGGGTG CTCTATGGGC TCAACGTCGC CAGCAACGGG CTGGTGCACT GGGACCGGTT CGGGGATGTC GACAACCACA ACGCGGTCAT CCTCGGCCGC TCAGGTGCCG GCAAGTCCTA TCTGGTCAAG CTCGAACTCC TGCGCTCGCT GTACCGGGGC ATCGAGGTCC ACGTCGTCGA CCCGGAAGAC GAATACGCCC GGCTCGCCGC CGCGGTCGGC GGGACCTACC TGCATCTCGG TGGCGACGGG GTACGGATCA ACCCGTTCGA CCTGCCTATT CAGACCACTC CCGACGGCCG GCGGACCGCG CCGCGCGACG CGCTGGTGCG GCGCAGCCTG TTCCTGCACA CCGTGATCTC CGTGCTGGTC GGAGAAATGA CGGCGGCCGA ACGGGCAGCC CTCGACGCGG CGATCACCGC CACCTACCAG GCGGCCGGAA TCAGCTCCGA CCCGCGCAGC TGGAACCGGC CAGCACCCCT GCTGACCGAC CTGGCCGACC TGGCCGCAAC CCTGGCCAGC TCCACCGGCC CGGCGGCGGC GCTCGCCGCC GGGCTGTACC CCTTCACCCA GGGGGCGTTC TCCGGCCTGT TCGACGGCCC CACCAGCGCG CCCGGCGACG GCCGCCTGGT CGTCTACTCG CTGCGCGACC TGCCCGACGA GCTCAAAGCC ATCGGCACCC TGCTCGTCCT GGACGCGATC TGGCGGCGGG TGTCCAACCC CGCCGACCGC CGGCCCCGCC TGGTCATTGT CGACGAGGCG TGGCTGCTCA TGCGCCAGCC CGCCGGTGCG GACTTCCTGT TCCGGATGGC GAAGTCCTCC CGGAAGTACT GGGCCGGGCT CACCGTCGCC ACCCAAGACA CGGCTGACGT GCTGGCCACC GATCTCGGCC GGGCGATCGT CACGAACGCC GCCACCCAGA TCTTGCTACG GCAGGCACCG CAGGCGATCG ATGAGATCAC CGCCGTATTT GACCTGTCCC AGGGCGAACG GCAGTTCCTT TTGGCCGCCG ACCGCGGACA GGGACTCCTC GCGGCCGGGG CACAGCGAGT CGCCTTCCAG TCCCTCGCCT CCGGGGTCGA ACACGCCCTG ATCACGACAA ACCCGGCCGA ACTCGCGGCA GACACCGATG GGGCCGACGA CGGCTTCTTC GATCTCGCCA TATCAGACGA CCCGATCGAT CCCGACGGTC AGATCTACCT CGACCCCGCC TGA
|
Protein sequence | MSRRSRRRTS AQAPGPSAGP LDAAAAAFVP DALSIAPRHL DVGGDFLATM AITGYPREVH AGWLAPLVTY PGRVDVAVHV EPIDPVTAAN RLRRQLAKLE SGRQLGDEKG RLVDPQVEAA TEDAYDLSAR VARGEGKLFR LGLYLTVHAS SENELADEVA AVRALAASLL LDAKPVSYRS LQGWVSTLPL GLDQVRMRRT FDTTALSAAF PFTSPDLPPP DPTSLAPTGV LYGLNVASNG LVHWDRFGDV DNHNAVILGR SGAGKSYLVK LELLRSLYRG IEVHVVDPED EYARLAAAVG GTYLHLGGDG VRINPFDLPI QTTPDGRRTA PRDALVRRSL FLHTVISVLV GEMTAAERAA LDAAITATYQ AAGISSDPRS WNRPAPLLTD LADLAATLAS STGPAAALAA GLYPFTQGAF SGLFDGPTSA PGDGRLVVYS LRDLPDELKA IGTLLVLDAI WRRVSNPADR RPRLVIVDEA WLLMRQPAGA DFLFRMAKSS RKYWAGLTVA TQDTADVLAT DLGRAIVTNA ATQILLRQAP QAIDEITAVF DLSQGERQFL LAADRGQGLL AAGAQRVAFQ SLASGVEHAL ITTNPAELAA DTDGADDGFF DLAISDDPID PDGQIYLDPA
|
| |