Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1180 |
Symbol | |
ID | 5669593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1403454 |
End bp | 1404539 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240112 |
Product | NusA antitermination factor |
Protein accession | YP_001505540 |
Protein GI | 158313032 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00193855 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.133993 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCTCG ACGTCGCTGC GCTGCGGGGA ATCGAGCGCG AGAAGGAGAT CGCCTTCGAC ACGCTCGTCG AGGCGATCGA GACCGCGTTG TTGACCGCCT ACAAGCACAC CACCGGATCC GCCGACGACG CTCGGGTGGT GATCGACCGG ACCAGCGGCG AGGTCGCGGT CTTTGCGCGG GAGAGCGGCC CGGACGGGAC GTCCCGCGAG TGGGACGACA CCCCGGCGGA CTTCGGTCGG ATCGCCACGA TGACCGCCAA GCAGGTCATC ATGCAGCGCC TGCGCGAGGC CCAGCAGGAG GTCACCTACG GCCAGTACGC CGACCGCGAG CACGAGGTGG TCTCGGGCGT GGTCCAACAT CACGAGCAGC GGGCCGGTTC CCGCGTCGTG CTGGTCGATC TCGGCACGGT GGAGGGCGTG CTGCCCCCGG CCGAGCAGGT TCCCGGCGAG CGCCTGGAGC ACGGTGACCG GATCAAGTGC TATGTGGTGC ACGTGGCCCG CGGGATGCAC GGCCCGACGG TCACCCTCTC GCGGACCCAT CCCGAGCTGG TGAAGGGCCT GTTCCGGCTG GAGGTGCCCG AGGTCGCCGA CGGCACGGTC GAACTCGCCG CGATCGCCCG CGAGGCCGGT CACCGCACGA AGATCGCGGT GCGTTCGAAG GCGGCCGGGG TGAACCCGAA GGGCGCCTGC ATCGGCCCGA TGGGCAGCCG GGTGCGCGCC GTGATGGCGG AGCTGCACGG CGAGAAGATC GACATCGTCG ACTGGTCGGC GGATCCCGCG TCCTTCGTGG GCAGCGCGCT CTCGCCGGCC AGGGTGTCCC GGGTGGAGGT CACCGACCTG GCGAGTCGTT CGGCACGGGT GGTCGTTCCC GACTACCAGC TCTCGCTCGC GATCGGCCGG GAGGGGCAGA ACGCCCGGCT GGCCGCCCGG CTCACCGGAT GGCGGATCGA CATCCACTCC GACACCGAGG GTAGCGAGCC GCGCGCGGAG CGGCCGGCCG GGGAGGCCCC CCGTCGTCCG GGGACGGGGA CCGGGCCCCG GAGGTCCTCC GCAACGGGCG GCCACTCTCG GGGCGCAACG GGATAG
|
Protein sequence | MKLDVAALRG IEREKEIAFD TLVEAIETAL LTAYKHTTGS ADDARVVIDR TSGEVAVFAR ESGPDGTSRE WDDTPADFGR IATMTAKQVI MQRLREAQQE VTYGQYADRE HEVVSGVVQH HEQRAGSRVV LVDLGTVEGV LPPAEQVPGE RLEHGDRIKC YVVHVARGMH GPTVTLSRTH PELVKGLFRL EVPEVADGTV ELAAIAREAG HRTKIAVRSK AAGVNPKGAC IGPMGSRVRA VMAELHGEKI DIVDWSADPA SFVGSALSPA RVSRVEVTDL ASRSARVVVP DYQLSLAIGR EGQNARLAAR LTGWRIDIHS DTEGSEPRAE RPAGEAPRRP GTGTGPRRSS ATGGHSRGAT G
|
| |