Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0529 |
Symbol | |
ID | 5668947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 615599 |
End bp | 618124 |
Gene Length | 2526 bp |
Protein Length | 841 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239457 |
Product | hypothetical protein |
Protein accession | YP_001504895 |
Protein GI | 158312387 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.628545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGGC TGGTCTGGCA CACCGTCCGC GCCCGCAAGG GCAGCCTGGT GGGCACATTC GTCGCGCTGA CGCTCGGCGT GGCGCTGCTG GCCGCGATGG CGCTGACCCT CGTGAGCAGC GTCGGGGGCG GCGGGGGCCG CCCGACCTGG TACGTCGATG CCGACGTGGT CGTGGCGGGT GGCGGCATTG TCAGTGTCAC CACCGGTTCA GGCGAGGACC GGGAGACGGC CTCGCTGCGT ACCGCCACCG CGCGCGGGCT GCCGAGCGGC CTGCGCGACC GTCTGGCCGG CCTGGACGCT TCCCTGGTGC TGGACTTCGC GGCGTACGCG ACGGCGCCCG GCGCGCCCGG TGACACCGTC CGCCCGTGGT CGGCCGCCGC CCTGCACGCC TACGGGTGGG TCGCGGGCGG GCCGCCCCGT GGTGCTCGGG ACGTCGTGCT CAGCGCCCCG ACTGCCCACC GTCCCGGAGC CGAGATCACC GTCGTCACCG GGCGCGGCGT GGAGCGCTTC GTGGTCAGCG GTGTGCTGCG CACCGACGCC CCGGCGGCGC TGTACACCGC CGACGCCGTT GCCGAGGAGC TCGCCGACGG CCGCGTCGCG GCGGTCGCGC TGAACGCGCC GGGCGCCGGC CCAGCCGGCG CCAGCACGCC CGCCGTCATG TCGTTGGCCG ACGCCGCCCG CGCCGTCGTC GGCGACACCG CGCTCGAGGA CGGCTCCGTT CAGGTGCTCA CCGGCGACGA CCGGCGCCGA GCCGAGCCGG ACCCCGACGC CGAGCGGCGG ACGGAGGCGG TCGCCCTGCT CGCCGCCACC ACGGGGCTCG CCGGCTTCGT GTCGATCTTC GTGGTCTCCG GGACGTTCGC CTACGCCGTG ACGGCCCGGC GCCGTGAGTT CGGCCTGTTG CGGGCCGCCG GCGCCACGCC CCGCCAGGTC TTTCGGATCG TGCTCGGCGA GGCCCTCACC GTCGGTGTGC TCGCCTCGCT GGCCGGCGGT GCCCTGGGAG CCGCGATCGC CCCGGACTTC GCCGCGCGCC TGGCCCGCAC CGGGTTCGTG CCCAGCGACT TCACCGCCCG GTTCGTCTTC TGGCCGGTGG CCGCCGCGTT CGGGACCGGG CTGGGCGTGG CCCTGCTCGG TGCCTGGGTG GCGGCGCGGC GAGCGGGGCG GGTGCGCCCG GTCGAGGCGC TGCGCGACGC GGCGGTGGAC CGTCGGCCGA TGACGACGGC GCGCGCGCTG GCCGGCCTGC TCGCTCTCGG CTGCTGCGTG CCGCTGGTCG CGGTGCTGGT GGCGAACCCG AGCGCGGACG CCGTCGCGCT CATCATGATC ACGGCGCTGT TCCTGATCGT CGCCTGCGCG ATGTTCGCTC CGCTCGTGGT GCCGCCGCTG GTGTCGCTGC TCAGTGCCCC GCTGTCGGGG TCGTCGGGGG CGGTGGGCCT GCTCGCCGGC CATGGCGCGC GGGCGGCCGT ACTGCGCACC GCGGCGACCG CGGCGCCGAT CCTGGTGACC GTCGGGATTG CCGGCTCCAC CCTGACCGGG CTCGGCACCC TGCAGGCGGC GACGCAGAAC GCGGCACGGG AGCGGATCAC GGCCGAGGCA CTCACCATGC CCGTCACCGG GAAGGGGCTG CCCGACGCGA GCGTCGCCGC CCTGCGCGAG GTCCCCGGGG TGAGTGCCGC CGTTCCCGTC ACCGAGAGCC GGGTCTACGT TCGGGACGGC GACGAGCCGG AGGGCTGGAC CGGCTACTAC GCGTCGGGCG CGGACCTCGC CGCCGTACTC GACGTCCCCC TGGTCGCGGG ATCGCTGGCC GACCTCGCCG GCACCGACAC GGTCGCCGTT CCGGAGGGTC GCTGGGAGCT CGGCGAGACG GCCGAGCTGT GGCTCGGTGA CTCGACGCCG GCACGGCTGC GGGTGGTGGC CGTCTTCGAG AGGCAGCTCG ACCTCTCGGA GACCGTCCTG CTGCCGTCGC GGCTGCGCGA CCGCCACGCC CTCCCGGGGG CCGACGTCGT CTACCTGCGG CTGGCACCGG ACGCGTCGTT GGAACAGGTA CGCGCGGCTG CCGCCGCCGG TGCCGGAACG GTGGTCGACA CCGGGAGCTA CCTCTCGGCG GCGGGCGAGG AGGAGGCACG GGTGAACCGG CAGGCGTCGG TCGCCATGCT CGGCCTGTCG CTCGTCTACA CCGGCATCGC GATCGCCAAC ACCCTCGTGA TGGCGACCCG GGACCGCGCA CGGGAGTTCG CGACCATCCG GCTCGCCGGT GCCACCCGCC GTCAGGTGCT GTGGGTGGTC GGCACCGAGG CGGTGCTGGT GACCTGCATC GGGGTGCTGC TGGCCGCGGT CGTCACGGCG GTCACGGCAC TCGGTGCCCG CCACGGCCTG GCCGACATCG CGCCGTCCGT GCCGCTGGCC GTGCCCTGGG CTCCGCTCGC CGGGATCGTC CTGGCCTGCC TGGTCACGGC CGTGCTGGCC AGCGTGATCC CGGCCGCGCT GCTGCTGCGT CGCCGCCCCG CCGAGCTGGC CGGCGTCCGC GAGTAG
|
Protein sequence | MIRLVWHTVR ARKGSLVGTF VALTLGVALL AAMALTLVSS VGGGGGRPTW YVDADVVVAG GGIVSVTTGS GEDRETASLR TATARGLPSG LRDRLAGLDA SLVLDFAAYA TAPGAPGDTV RPWSAAALHA YGWVAGGPPR GARDVVLSAP TAHRPGAEIT VVTGRGVERF VVSGVLRTDA PAALYTADAV AEELADGRVA AVALNAPGAG PAGASTPAVM SLADAARAVV GDTALEDGSV QVLTGDDRRR AEPDPDAERR TEAVALLAAT TGLAGFVSIF VVSGTFAYAV TARRREFGLL RAAGATPRQV FRIVLGEALT VGVLASLAGG ALGAAIAPDF AARLARTGFV PSDFTARFVF WPVAAAFGTG LGVALLGAWV AARRAGRVRP VEALRDAAVD RRPMTTARAL AGLLALGCCV PLVAVLVANP SADAVALIMI TALFLIVACA MFAPLVVPPL VSLLSAPLSG SSGAVGLLAG HGARAAVLRT AATAAPILVT VGIAGSTLTG LGTLQAATQN AARERITAEA LTMPVTGKGL PDASVAALRE VPGVSAAVPV TESRVYVRDG DEPEGWTGYY ASGADLAAVL DVPLVAGSLA DLAGTDTVAV PEGRWELGET AELWLGDSTP ARLRVVAVFE RQLDLSETVL LPSRLRDRHA LPGADVVYLR LAPDASLEQV RAAAAAGAGT VVDTGSYLSA AGEEEARVNR QASVAMLGLS LVYTGIAIAN TLVMATRDRA REFATIRLAG ATRRQVLWVV GTEAVLVTCI GVLLAAVVTA VTALGARHGL ADIAPSVPLA VPWAPLAGIV LACLVTAVLA SVIPAALLLR RRPAELAGVR E
|
| |