Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2548 |
Symbol | |
ID | 5670942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3028956 |
End bp | 3030845 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241464 |
Product | Type IV secretory pathway VirB4 protein-like protein |
Protein accession | YP_001506884 |
Protein GI | 158314376 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0379852 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGAC GAACCCGACG CCGCGCCTCC ATGCAGGCCC ACAGCCCGTC GACACAGCCG GTGAACGCCG CCGCGGCGGC GTTCGTCCCG GACGCACTCT CGATCGCCCC CCGCCATCTC GACGTCGGTG GGGACTTCCT CGCCACCATG GCCATCACCG GCTATCCCCG CGAAGTCCAC GCCGGCTGGC TCGCCCCGCT GCTGACCTAC CCTGGCCGGG TCGACGTCGC CGTGCACGTC GAGCCGATCG ACCCGGTCAC CGCCGCGAAC CGGCTCCGCC GGCAGCTGTC GAAGCTGGAG TCCGGCCGCC AGCTCGGCGA CGAGAAAGGC CGGCTGATCG ACCCGCAGGT GGAGGCGGCA ACCGAGGACG CCTACGACCT GTCCGCCCGC GTCGCCCGCG GCGAAGGCAA GCTCTTCAGG CTTGGTCTGT ACCTCACCGT CCACGCGGGC AGCGAAACCG AGCTCGCCGA CGAGGTCGCC GCTGTCCGTG CGCTGGCCGC CAGCCTGTTG TTGGACGCCA AACCGACCAG CTACCGGTCC CTGCAAGGCT GGGTCAGCAC CCTGCCCTTG GGCCTAGACC AGGTACGGAT GCGCCGCACC TTCGACACCG CAGCGTTGAG TGCTGCGTTC CCGTTCACCA GTCCGGACCT GCCGCCCGCC GACCCCACGT CGTTGGCGGC GACCGGGGTG CTCTACGGGC TCAACGTCGC CAGCAACGGG CTGGTGCACT TCGACCGCTT CGGCGACGTC GACAACCACA ACGCCGTGCT CTTCGGTCGT AGCGGCGCGG GGAAGAGCTA TCTGGCCAAG CTCGAACTGT TGCGCTCGCT GTACCGGGGC ATCGAGGTCC ACGTCGTCGA TCCCGAAGAC GAATACGCCC GACTCGCCAC CGCGGTCGGC GCGACCTATC TGCACCTTGG CGCCGACAAC GTGCGGGTCA ACCCGTTCGA CCTGCCGATC CAGACCACCC CCGACGGGCG GCGGACAGCA CCCCGTGACG CCCTGGTGCG CCGCAGCCTG TTCCTGCACA CCGTCGTCGC GGTGCTCGTC GGTCAGCTGT CCGCGGCTGA ACGGGCAGTC CTCGACGTCG CGATCACCGC CACCTACCAG ACGGCGGGGA TCAGCTCCGA CCCACGCACC TGGAGCCGAC CGGCACCGCT GCTGGCCGAC CTCGCCACGA CCCTGGCCGC CTCCGACGAC CCGGCGGCAG TGACCCTCGG TGCCCGGCTG CACCCGTACA CGGCAGGGGC GTTCTCCGGC CTGTTCGACG GCCCTACCAG TGCGCCTGGC GACGGCCACC TCGTCGTCTA CTCCCTGCGC GATCTGCCCG ACGAACTCAA AGCCATCGGC ACGCTGCTCG TCCTCGACGC CGTGTGGCGG CGGGTGTCCA ACCCCGCCGA CCGCCGACCC CGCATGGTCG TGGTCGACGA GGCGTGGCTG CTGATGCGCC AACCGGCCGG TGCGGACTTC CTGTTCCGGA TGGCCAAGAG CGCGCGGAAG TATTGGGCCG GGCTGACCGT CGCGACCCAG GACACCGCCG ACGTGCTCGC CACCGACCTG GGCAAGGCGA TCGTCACGAA CGCCGCCACC CAGATCCTGC TGCGCCAGGC ACCGCAGGCG ATCGACGAGA TCACCGCCGT GTTCGACCTG TCCCAGGGCG AACGGCAGTT CCTGCTCTCC GCCGACCGCG GACAGGGACT CCTCGCGGCG GGGGCACAGC GGGTCGCCTT CCAGGCCCTG GCCTCGCCCA GCGAGCACCG CCTGGTCACG ACCAACCCCG CCGAACTCGC CGCCGACCCC GACGAGGCCG GCGACGACGG CTTCTTCGAC CTCGCCGCGC CCGCTGGCCC GGCCGATGAC GACGGCCAGA TCTACCTCGA CGCCGCCTGA
|
Protein sequence | MSRRTRRRAS MQAHSPSTQP VNAAAAAFVP DALSIAPRHL DVGGDFLATM AITGYPREVH AGWLAPLLTY PGRVDVAVHV EPIDPVTAAN RLRRQLSKLE SGRQLGDEKG RLIDPQVEAA TEDAYDLSAR VARGEGKLFR LGLYLTVHAG SETELADEVA AVRALAASLL LDAKPTSYRS LQGWVSTLPL GLDQVRMRRT FDTAALSAAF PFTSPDLPPA DPTSLAATGV LYGLNVASNG LVHFDRFGDV DNHNAVLFGR SGAGKSYLAK LELLRSLYRG IEVHVVDPED EYARLATAVG ATYLHLGADN VRVNPFDLPI QTTPDGRRTA PRDALVRRSL FLHTVVAVLV GQLSAAERAV LDVAITATYQ TAGISSDPRT WSRPAPLLAD LATTLAASDD PAAVTLGARL HPYTAGAFSG LFDGPTSAPG DGHLVVYSLR DLPDELKAIG TLLVLDAVWR RVSNPADRRP RMVVVDEAWL LMRQPAGADF LFRMAKSARK YWAGLTVATQ DTADVLATDL GKAIVTNAAT QILLRQAPQA IDEITAVFDL SQGERQFLLS ADRGQGLLAA GAQRVAFQAL ASPSEHRLVT TNPAELAADP DEAGDDGFFD LAAPAGPADD DGQIYLDAA
|
| |