Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3822 |
Symbol | |
ID | 5672186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4540063 |
End bp | 4541799 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242701 |
Product | Rhs element Vgr protein |
Protein accession | YP_001508121 |
Protein GI | 158315613 |
COG category | [S] Function unknown |
COG ID | [COG3501] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01646] Rhs element Vgr protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.112104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0443202 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACCC CCGAGACTTA CGGCGCGCTG CCCGTCCTCT ACCTGGACGG GAAGCCGGTG CCGCCCGCGA TCAAGGAGAC TATCCTGCGG GTCGTCGTGG ACAGCGACGT CGCGGCGCCG GACGCCTGCC GCGTGGTACT CAATGACCCC GGCCGGGACG TGCTCGCGGC GGCCGGGTTC GACTTCCGCC ACGCGCTGAA GGTCACCGCG CCGCCGAGGG CCACCCCCGA GGGCGGGGGC GCGGAGAAGG TCCTCTTCGA GGGCACCATC TACAGCCTCG GCTTCGGCTA CGACGAGCGG GGCGCGACCG CTGTCGTGGT GGCCTACGAC AGCTCATACG CTCTGTTCAA CGGCGTGCAC ACGGCCACAT ACCACAACGT CACCGACTCC GACCTGGTGA CGAAGATCGC CCGCGAGCTG AGCATAGACA CCGGCACGAT CAATCCGACG ACGGTCGTCC ACGAACACGT CGGCCAGGTC AACGAGACGC ACTGGGACTT CCTCACCCGA AGGGCCAGAG AGGTCGACCA CGTGCTCCGG GTACGGGACA ACAAGATGGA GTTCGTCCGT CCCACCGCCG CCGACGACGC GCCCAGGCCG GGCAACTTCG ACAGCCCGAG CCACCTGCAG CTCACCCCAG GTGGCGACCT GGACGTCTTC ACCGCCCGGG TGACCGCCGC GCAGCAGGTG TCCGAGGTCG AGGTCCGCGG CTGGGACGAC CGGGGCAAGC GCGAGCTGGT GGCCACCGCA CGGGCGAGCA CTCGCGCCGC GCAGATCAAG GACGACCCGG CCGACCTGGG GGCGGGCAAC TCCTCCGCCC GGTACGTCGC TCCCGCCCGG CCGCTCGCGA CGCAGGCCGA GTGCGACGCG ATGGTCGCCG CGGTCGCCGA GCGGATCGCC AGCACCTCGG TCGCCGCCGA GGGCGTGGCC CACGGAGATC CGCGCATCCT CGCCGGTGTG GCGCTGAGCG TGGGCCGGAC CGGGGGGAGC TTCGACGGCA AGCTCACCGT CTCCCACGCC GAGCACGTCT TCGACCACGC CAGCTACCGC ACCCGGTTCA CGGTGAGCGG GCCGCACGAC CGGTCCCTGC TCGGTCTGGC CTCGGCCGCC GGCGCCCGGC AGAGCAGCCC GCTGATCGCC GGTGTGGTGC CGGCCGTCGT CTCCAACATC AACGATCCGG AGTCCCGCTG CCGGGTGCGG GTGAAACTGC CCTGGCTGTC CGCGGACTAC GAGACCGACT GGGCCCGGGT CGCGATCGCC GGCGGCGGCC CGGACCGCGG GATGCTGGTG CTGCCCGAGG TCAACGACGA GGTGCTGGTC GCTTTCGAGC AGGGTGACCC GCGCCGCCCG TTCGTGCTCG CCGGCCTGTA CAACGGTGTG GACGCGCCGC CCTTCGGCGG CGGCGTCGAC ACCGCCGCCG GCACCGTCGT CCGGCGCGGC CTGCGCACCC GGAAGGGCCA CGAGATCGTG GTCAGCGACG CCGACGGCGA CGAGCACGTG GAGATCCGCA CCCGGGACGG CAAGGTGCGG ATCCGGCTCG ACCACGACCA GGGCGGGCTC ACCATCGAGA CCGACGCGGA CATCGACGTC CGGGCCAAGG GGAAACTGTC GCTCACCGCC GAGCAGGACC TGACGATCTC CGCGCGGGGT ACCGGGTCGA TCTCTGCTGA CGCCGGCCTC ACCCTGTCCA GCCGCGCGGA CGTCACAGTG CAGGGAAACC CGATCAAGCT CAACTGA
|
Protein sequence | MATPETYGAL PVLYLDGKPV PPAIKETILR VVVDSDVAAP DACRVVLNDP GRDVLAAAGF DFRHALKVTA PPRATPEGGG AEKVLFEGTI YSLGFGYDER GATAVVVAYD SSYALFNGVH TATYHNVTDS DLVTKIAREL SIDTGTINPT TVVHEHVGQV NETHWDFLTR RAREVDHVLR VRDNKMEFVR PTAADDAPRP GNFDSPSHLQ LTPGGDLDVF TARVTAAQQV SEVEVRGWDD RGKRELVATA RASTRAAQIK DDPADLGAGN SSARYVAPAR PLATQAECDA MVAAVAERIA STSVAAEGVA HGDPRILAGV ALSVGRTGGS FDGKLTVSHA EHVFDHASYR TRFTVSGPHD RSLLGLASAA GARQSSPLIA GVVPAVVSNI NDPESRCRVR VKLPWLSADY ETDWARVAIA GGGPDRGMLV LPEVNDEVLV AFEQGDPRRP FVLAGLYNGV DAPPFGGGVD TAAGTVVRRG LRTRKGHEIV VSDADGDEHV EIRTRDGKVR IRLDHDQGGL TIETDADIDV RAKGKLSLTA EQDLTISARG TGSISADAGL TLSSRADVTV QGNPIKLN
|
| |