Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1381 |
Symbol | |
ID | 5669789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1671183 |
End bp | 1673153 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240307 |
Product | Type IV secretory pathway VirD4 protein-like protein |
Protein accession | YP_001505734 |
Protein GI | 158313226 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTGG AACGCGGCCG ATTCGGTGAT GCCGGCTTGG ACGCGGAGAT CGCAGTGGTC GCCGTGGCGC TCCTCGTCGC GTCGTCGGCG GCGGCAGTCT GGGCAGGCGC GCAGCTGGCC GCGTTGGCGT TCGGTGCGCA CCGCCCCCTC GACGTGGGCC TGGGCGACGG GCTGGCCGCG CTACCTCAGC TGATCGCCGA CCCTGCCCAT CCGGCGCACG CCTGGCCGGC GCGCGCCGCA GGCGACCTGC CGGGGCAGCT CGGGTACTGG GCGGCGACCG TGGTGCCGCT CGCGGTACTG GCCATGCTGA GCGGGCTCGC GGCCTTCCTG CTCGTCGGCG ACGGGGTGGG CGTGGCGCGC CGGCGCCGGA TGGGTATCGA TCCCGAAGCC CGGTTCGCCC GCCCGGCGGA TCTGGCACCG CTGTGGCTGG GCGGGCCGAC CCGCGGTCGG ATGATCCTGG GCCGGATCGG TGGCCCCCGG GGACGGCTCG TCGCCACCGA GGACACCAAC CGTCCACTGG ACGCGGCGGT GCCCAGATGG CGAGCACGAC GGGCAGCCCG TCGCCGCGGG CAGCGCGGCA GCGTGATCGT GCTAGGCCCG TCCCAGTGCG GCAAGACCGC CGCCCTGGCG ATCCCCGCCA TCCTGGAATG GGACGGCCCG CTGATCGCCC TGAGTGTGAA GAACGACCTG CTGGGTGCGA CGATCTCCCG CCGTCGGCAG GTCGGCGACG TCGCGGTGTT CGACCCGGCG GGCGTCACCG GCGAACTCGG CGCGCCCTGG TCCCCGCTGG GTGCGGCGCG CACCCTGGCC GGCGCGCGCC GCGCCGCCCG CTCGATCGCG AACGCCACCT CCTGGACGTC GGCCTCGTCG GGCGACATGG GCTTCTGGAC CGCGGCGGCC GAGGACCTCC TCGGCCAGCT GTTCTGGACC GCCGCCGTCG TCGACCTGGG CATGGACACC GTCGTGAGCT GGGTGGTGTC CATGGACAAG GAGACCGTCC GCGGCCTGCT GACCCCGCTG GCCTCCCACC GCGACCCGAC ACTCGCCGCG GACGGTACGC AGGTCCTCGC CGGCTTCCAG GGGATCTGGG CGAACGACCG CCGGCAGATC TCCTCCACCT ACCTGGTGGC CCGGCAGATG ATCCAACCCT GGCAGGAACC GGAGATCGCT GCCTCCGCCA CGGCGTCGCA TCTTGATCTT GAATGGCTCC TCGACACCGG CCCCGACGGG CAGGCTGCGA ACACGCTGTA CCTGAGCGCG GATCTCGACG ACGCAGAACG CCTGGCCCCC GTGCTCGGTG GCCTGCTCGA CGACCTGATG CGCCAGGCCT ACAGCCACGT CGGGCGAACC GGCGTCCCGT TGGACCCGCC GCTGCTGGTG GTCGTGGACG AAGCCGGGAA CTGGCCGATG CGCAACCTAC CGGGACGGAT CTCCACCTGT GCCGGCATCG GTATCCAGCT GGTGCTGGTG TATCAGAGCA AGGCGCAGAT CGACGCCGCC TACGGCCCGA AAGCCGACAT CGTCATCTCC AACGCGGTCA CCAAGGTCTT CTTCGCCGGC CAGTCCGACC GCTCCACGCT CGAGTACGCC GCCGGCCTGC TCGGCCAGGA GCATGTCGTC CAGACGTCCA CCAACGTCGA CAGCACTGGT CTGGTCGGCC CGTCCGGCCG GCGCGGGGTC TCGCGCAGCC CCACCCGGGT GGAGCTGCTG CCCTCCGCGC TGCTGCGGCA GGTCGCCCCC GGCCAGGCGC TGCTCGTCCA CAACACCCTT CCGCCCGCCC ACCTGTTCGG CCGCTACTGG TACCTGGACG AGGACCTGCA CGCACTCGCC ACCGGCCACC GGACCTCCCG ACGCGACCTG GTCCGCCAGG CCGCCCGCCA GCGGATCACT CCCGGGAACA GCCCTGACCG GCCGCCGCCC CCAACCGCCT CTGGGGATGA ACAATCGGCG GCTCCGTGGT GGCCCTGGTG A
|
Protein sequence | MSVERGRFGD AGLDAEIAVV AVALLVASSA AAVWAGAQLA ALAFGAHRPL DVGLGDGLAA LPQLIADPAH PAHAWPARAA GDLPGQLGYW AATVVPLAVL AMLSGLAAFL LVGDGVGVAR RRRMGIDPEA RFARPADLAP LWLGGPTRGR MILGRIGGPR GRLVATEDTN RPLDAAVPRW RARRAARRRG QRGSVIVLGP SQCGKTAALA IPAILEWDGP LIALSVKNDL LGATISRRRQ VGDVAVFDPA GVTGELGAPW SPLGAARTLA GARRAARSIA NATSWTSASS GDMGFWTAAA EDLLGQLFWT AAVVDLGMDT VVSWVVSMDK ETVRGLLTPL ASHRDPTLAA DGTQVLAGFQ GIWANDRRQI SSTYLVARQM IQPWQEPEIA ASATASHLDL EWLLDTGPDG QAANTLYLSA DLDDAERLAP VLGGLLDDLM RQAYSHVGRT GVPLDPPLLV VVDEAGNWPM RNLPGRISTC AGIGIQLVLV YQSKAQIDAA YGPKADIVIS NAVTKVFFAG QSDRSTLEYA AGLLGQEHVV QTSTNVDSTG LVGPSGRRGV SRSPTRVELL PSALLRQVAP GQALLVHNTL PPAHLFGRYW YLDEDLHALA TGHRTSRRDL VRQAARQRIT PGNSPDRPPP PTASGDEQSA APWWPW
|
| |