Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2157 |
Symbol | |
ID | 5670557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2585966 |
End bp | 2587519 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641241078 |
Product | hypothetical protein |
Protein accession | YP_001506499 |
Protein GI | 158313991 |
COG category | [S] Function unknown |
COG ID | [COG2006] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0332919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC GCCAGGAAAT AGACCGTGGA GTGCCGTACA ACATCCGAGA CAAAACCCAT CAGGTGGCTT TTACCCGCAC CGCGGCCGCG GCCTACCCAT CGAACGCCCC TTTCTCTCCG GACACGTCCT ATCCGGAGTA CGACCTGGGC CACGTCAACG ACGAACCTAA TCCGGTGTAC GCTGCCGTGC GCACGGTGCT CCGCCTGGCC GGGCTCGATC CGACGGCGGT CGACACGTCC TCCTGGAACC CACTCGGCGA TCTGGTCGCC TCAGGTGGCA CCGTGGTGGT CAAACCCAAC CTCGTACGCG AATCCCATCC CCGGGCTTCC GCAGGATGGA AGTGGGTGCT CACCCACGGC TCCGTGATTC GGGCAGTAGC CGACTACGCT TTCCTGGCGG TAGGCCGGTC AGGACGGGTC GTCGTGGCGG ACGCACCCCA GACCGATTCG TCATTTGCCG CGATTTCCAC CGTTCTCGGG CTCGACCAGT TGAGCCGGTT CTATCTGGAC CGCGGATTGC AGTTCGAGCT CGTCGACCTG CGCCAGGAGG AATGGACAAC GCGTGGCGAC GTCGTCGTGG CGCGCCACCG ACTCACCGGG GACCCAGCTG GTGCCGTCGC CTTCGACCTC GGACACTCGA GCGAGTTCGT CGATCACGGA GGATCGGGCC GCTACTACGG TGCCGACTAC GACTCACGGG TGGTCAATGA ACACCATTCC GGAGGCCGCC ACGAGTACCT GCTCTCCGGG ACGGTTATGA ACGCAGATCT CATCATCAAC ATACCCAAGC TTAAAAGCCA CAAGAAAGCC GGGATCACAC TCGGCATGAA GAACCTGGTC GGTGTGAACG CGGACAAGAA CTGGTTGCCT CATCATACTG AAGGCTGGCC CGGAAACAAC GGCGACGAGC ATCCTCGAGC CGACACGCGA CACCGCATTG AACGGAAGGC CGTGGCGGGC CTACGTCGAG CCGCGCTGGC CTGGCCTCGA GTCGGCGGAC ATGTCATGCG TCTCGCCCGG CAGGGCGGCA CACATGTCTT CGGTGACGGC GACACGGCCA TCCGCAGTGG CAACTGGTGG GGCAACGACA CGGTCTGGCG AATGTCCCTC GACCTCAACA AGATCGTCAT GTACGGCCGG GCGGACGGAA CGCTCTCCTC TCAGCCCACC GCACGCCGTC ATGTGGTGCT GGTGGACGGT GTCATTGCGG GACACCGAAA CGGACCGCTG AACCCCGACG CGATCCCTGG TCGGCTTTTG GCCTTCGGGC GCACGCCGGC CGCAGTGGAC GCCGCCACCA CCTACCTGTT CGGCTTCGAT CCGGACCGGA TTCCGACTGT TCGACAGGCT TTCATATGCC GCCACCTCCC CCTGGCAGCA GGGGACTGGC GCGACATCGA ACTGGTCGGA GACGACGAGA ATTGGTGTGG TCCGCTCAGT TCCCTGGCCG CGGGGGTGAC CTTGCTCGCC GAGCCGCACT TCGCGTGGAA GGGCCGGGTG GAACTCGTTC CAGCCCACGA TCATGCGGGA AACCCACGGG TAGGTACGAC ATGA
|
Protein sequence | MTERQEIDRG VPYNIRDKTH QVAFTRTAAA AYPSNAPFSP DTSYPEYDLG HVNDEPNPVY AAVRTVLRLA GLDPTAVDTS SWNPLGDLVA SGGTVVVKPN LVRESHPRAS AGWKWVLTHG SVIRAVADYA FLAVGRSGRV VVADAPQTDS SFAAISTVLG LDQLSRFYLD RGLQFELVDL RQEEWTTRGD VVVARHRLTG DPAGAVAFDL GHSSEFVDHG GSGRYYGADY DSRVVNEHHS GGRHEYLLSG TVMNADLIIN IPKLKSHKKA GITLGMKNLV GVNADKNWLP HHTEGWPGNN GDEHPRADTR HRIERKAVAG LRRAALAWPR VGGHVMRLAR QGGTHVFGDG DTAIRSGNWW GNDTVWRMSL DLNKIVMYGR ADGTLSSQPT ARRHVVLVDG VIAGHRNGPL NPDAIPGRLL AFGRTPAAVD AATTYLFGFD PDRIPTVRQA FICRHLPLAA GDWRDIELVG DDENWCGPLS SLAAGVTLLA EPHFAWKGRV ELVPAHDHAG NPRVGTT
|
| |