Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0118 |
Symbol | |
ID | 5668543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 140647 |
End bp | 141792 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239046 |
Product | UspA domain-containing protein |
Protein accession | YP_001504491 |
Protein GI | 158311983 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG GCGCCCTGAC CGGAACGCAG GCGGACCAGA CCCCGGAACG AACCGGCGCG CCGGCGATCG AGCGGCACAG CGGACCTACG GTGGGCACAG CAGCCGACAT AGTGGTCGGG ATCGACGGCT CCCCCGGCTC CGCGGCGGCG TTGACCTGGG CGGTCGCCGA GGCCAGCCGG CGCGGCCTGC GGGTGCGCGC CGTCCTCGGG TCCTGCGCCG ACGAGCAGCC CACCGCTGTA CGCAGGTCCG CCGACGCGAT CGCCGGGCCG CACGACGAGG CCACCTTGGC CTTCGCGGCC AGCCACCTCC TGCATGAGAC GATCGGCGCC GCCCCCATCC CCGCGGGACT CGAGGTCCTC GAGGAGGTGG TGGACGCTCC TGGCGCCGAG GCCCTGCTCA CCGCCGGCCG GGACGCCGCC ATGATTGTCG TCGGAGCACG CGGGCGCGGG CTCCTGCACC GCCTGCGGCT CGGATCGGTG AGTACGTCCG TGGCCGTCCA TTCCCCCGTG CCAGTGGTCG TCGCGCGACT CCCCCGTTCG GGGGATGCCG GCGAGCCCGA TGCGGACGGC CTCGCCGGTG CCGGGCCGGT CGCCGACGAG CGGCTGAGCC CCACGAGCGC ACCGAGGCAG GGCACGCCGC ACCGGCGGCC GGTGGTCGTC GGGGTCGACG GCTCACCCAA CTCGCTGGCC GCGCTGCGGT GGGCCGCGGT CACGGCGGCA CTGCGTGGGG CACCGCTGCA TGTCATCCAC AGTTGGCTCG CCGCGGTCCC CCTCCCGTTC GCCGAGACGT CCGGGGAGAT CGTGCAGGCG CTCGAAGGCC AGGCGCGGGC CGTGCTGGAC GAGTCCATCG AACAGGTCCT CGGCCCGATC CCCGGCGGCG AGCCAGGGGA GCCCGCCGAG CCGGGCGGTA CGGAGCCCGC CGTGCTCCGG CTCGCCGCTC CGGCACCCGG CTCCGGGGAG ATCGACGTCT ACCGTCAGCT GATACCCGCC TCTGCCACCC GGGCCCTCCT CGAGGCCAGC CACGACGCCG ACCTGCTGGT CGTCGGAGCC CGGGGCAAGG GCGGATTCGC CGAGCTCCTC CTGGGCTCGG TCAGCCACCA GACGATGCTG CACTCCGCCG CCCCCGTAGC GATCATCCGG GCCTGA
|
Protein sequence | MTDGALTGTQ ADQTPERTGA PAIERHSGPT VGTAADIVVG IDGSPGSAAA LTWAVAEASR RGLRVRAVLG SCADEQPTAV RRSADAIAGP HDEATLAFAA SHLLHETIGA APIPAGLEVL EEVVDAPGAE ALLTAGRDAA MIVVGARGRG LLHRLRLGSV STSVAVHSPV PVVVARLPRS GDAGEPDADG LAGAGPVADE RLSPTSAPRQ GTPHRRPVVV GVDGSPNSLA ALRWAAVTAA LRGAPLHVIH SWLAAVPLPF AETSGEIVQA LEGQARAVLD ESIEQVLGPI PGGEPGEPAE PGGTEPAVLR LAAPAPGSGE IDVYRQLIPA SATRALLEAS HDADLLVVGA RGKGGFAELL LGSVSHQTML HSAAPVAIIR A
|
| |