Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4831 |
Symbol | |
ID | 5673172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5769777 |
End bp | 5771063 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243687 |
Product | extracellular HAF |
Protein accession | YP_001509103 |
Protein GI | 158316595 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0943414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0197145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGCGT ATCCGTTCAG GGCTGAACCA CCGGGGCGGC GGCCCCGCCG GCGGGCGCGG CTCGTCCGCG CCACCGCGGC GGCCGTCGCG GCCGGTCTGC TCGGCAGCGC CGTGACCGCC TCGCCGGCCC TCGCGGCCGC ACCGCCCGAG ACGGCCGCAC CGCCCGAGAC TGCGGCGGCC TCCGAGACAG CCGCGGCGGC CGAGACGGCC GGTTCCCCGC GGGTGACCGT CACCGACGTG GGAGCGCCGG ACCGGCTGGC CACGAACCTG TTCCCCGAGG ACCTCAACGA CGCAGGCGTC GTCACGGGCT ACGGCCTGCT CGGCGGCCCG CAGCCCTTCC AGGCGTTCAC CTGGGCCGAC GGCACGCTCA CCCTGCTGAA CGCGCCGAGC GCCGACCCGG GGGCGTTCAG CTTCCCGGTC GCGCTGAACA ACCACGGCCA GGTCGTCGGC TTCACCACCG TCGGCGGGAC GGCGCACTCG CTGCGCTGGG ACGGCGCGGA CCCGACCGAC ATCTCCGCCG CCGGCGGGAA CAGCCACCCG CTCGCCGTCA ACGACGCCGG CCAGGTTCTG CTGACCGAGG GCGGCGCCGC GGCGCTGTGG ACCGCCGGCA GCCGGGTCGC GGTCGCGCCG TTCCCGGTCA CGAACGCCGT GGGCCTCAAC GGTTCGGGGC AGGTGTTCGG CACCGGCCGG GCCGCGGGCG CCGACGCCAC CGACCGCGCC TTCGTCTGGA CGCCGACCGC GACCACCGAC ATCGGTCCGT TCGGGCTCAC CACCACCACG ACCGACCTCA ACGACAGCGG CCAGCTGATC GGGTACGGCG CCTTGGCGCA GAGCCCCAAC CGCACCCATT CCTTCGTCTG GACCCCCGCG CGGGCCGGCC GGCCGGCTGC CCTCACCGAC CTGGGCACCC TGGCCAACCT GGAGACCGAG GCGCGCGACA TCAGCAACTC CGGCCACATC GTCGGGCGCA GCGGCACCCG CTCCGGCTGG CACGCCGTGC GCTGGCAGGG CGGCCGTCTC GTCGACCTCG GTGTCCTGCC CGGCGGGACG TCCAGCGAGG CGCTCGCCGT CAACGAGACC GGACAGGCTG CCGGCTGGGG CATCGCTGGC GACGGGCGGC CACATGCCAT CCTCTGGAAA AGGGACCGCC CGATCGACCT CGGGGTGCCC GCCGGTTTCA CCCAGAGCTT CGCAATAGAC ATCAACGCCG CTGGCAGGGT GCTCGGCTAC GCGATCGACG AGACGGGCGG CGTCCACAGC TTCGTCTGGA CGGTGACCGG CGGATGA
|
Protein sequence | MWAYPFRAEP PGRRPRRRAR LVRATAAAVA AGLLGSAVTA SPALAAAPPE TAAPPETAAA SETAAAAETA GSPRVTVTDV GAPDRLATNL FPEDLNDAGV VTGYGLLGGP QPFQAFTWAD GTLTLLNAPS ADPGAFSFPV ALNNHGQVVG FTTVGGTAHS LRWDGADPTD ISAAGGNSHP LAVNDAGQVL LTEGGAAALW TAGSRVAVAP FPVTNAVGLN GSGQVFGTGR AAGADATDRA FVWTPTATTD IGPFGLTTTT TDLNDSGQLI GYGALAQSPN RTHSFVWTPA RAGRPAALTD LGTLANLETE ARDISNSGHI VGRSGTRSGW HAVRWQGGRL VDLGVLPGGT SSEALAVNET GQAAGWGIAG DGRPHAILWK RDRPIDLGVP AGFTQSFAID INAAGRVLGY AIDETGGVHS FVWTVTGG
|
| |