Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6689 |
Symbol | |
ID | 5675002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8124439 |
End bp | 8125608 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245537 |
Product | extracellular HAF |
Protein accession | YP_001510929 |
Protein GI | 158318421 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGTG ACCGTCAACC AGTCGCGCGC TTCGCGGTGG TGGCCGCCGC TTCCTCGGGG CCGGTCAACC TGAGCAACCC GACGTCAGCG GACGCCTCGG CCACCGCGGC GCGACCACCG TGGCAGGCCC GCCCGGCCGC GACCGCCGTC ACCGGGCCAC CGGGCGTCCC AGGCCTCCTG GTGTGTGGGA ACGCCAGTGG CCTGCTGGCC GGCCTCTGGC AGAGCGGCCG GGACCGGGTC GACGGCTTCC TGTGGCAGGC CGACACGGTT CGCCGCCTTC CCGGGAGCAG CCGTCCGATC GCCATGAGCT CTCTGGGCGT CGTGGTCGGC GAGATGATTG GCAGGCAGGG CCCGCAGGCA TTCCGCTGGA GCTACGGCAT CCACTCCGCG CTTCCCTACC TGGGCGGCAC GGACATCCCG GGAGGGCATT TCAGTATCGC CGTGAGCGTC AACGGCCTGG GCCTGGTCGC GGGCAGCAGC AGCACCGACA GCGGCGATCG GCATGCCTGC CTGTGGCGGG AACACATCAG CGAGGACCTC GGCACCCTTG GCGGCCCGGA GAGCTCCGCC GCCGCCGTCA ACGACCTCGG CCTGGTGGTC GGTACGAGTC AGACGCGCAG CGGGGCCGAC CACGCGTTCG TGTGGTCCGA CCGCCGTATG CATGACCTCG GCACCCTCGG TGGCACCTGG AGCAGGGCGT CCGGCGTGAA CAACGCCGGG CTGGTCATCG GGAGCAGCGG CACGGCGGAC AACCGGACCC ACGCGTTCTG CTGGCAGAAC GGAATAATGA CAGATCTCGG GACACTCGTT GGAGATGAGC ACAGCGAGGC CGTCGCGGTG AACAATGCCG GCCAGGTGCT CGTGCGCAGC TACGGCGGGG GAGTTCGGGC ATTCCTCTGG TCCGGCGGCG ACCTCACCGA GATCCACGGT TTCGGCGAGG GTTGGGTCGA CCCGGCCGGG CTCAACGACT CCGGCGTGGT CTGCGGAACG GTCGAGCTCC CGACCGGGGA CGGGCATGCG TTCCGCTGGC GGCACGGAGT GGTCACCGAT CTCGGGACCC TCGGCGGGAG GCAGAGCTCG GCCAGTCACG TGACGGCCAG TGGAGTCGTT CTCGGTGAGG CGGTGAGCGT CTCGAGCCCC GTCCCCCACG CCGCCTTCTG GACACCGTAG
|
Protein sequence | MVSDRQPVAR FAVVAAASSG PVNLSNPTSA DASATAARPP WQARPAATAV TGPPGVPGLL VCGNASGLLA GLWQSGRDRV DGFLWQADTV RRLPGSSRPI AMSSLGVVVG EMIGRQGPQA FRWSYGIHSA LPYLGGTDIP GGHFSIAVSV NGLGLVAGSS STDSGDRHAC LWREHISEDL GTLGGPESSA AAVNDLGLVV GTSQTRSGAD HAFVWSDRRM HDLGTLGGTW SRASGVNNAG LVIGSSGTAD NRTHAFCWQN GIMTDLGTLV GDEHSEAVAV NNAGQVLVRS YGGGVRAFLW SGGDLTEIHG FGEGWVDPAG LNDSGVVCGT VELPTGDGHA FRWRHGVVTD LGTLGGRQSS ASHVTASGVV LGEAVSVSSP VPHAAFWTP
|
| |