Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3701 |
Symbol | |
ID | 5672067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4382428 |
End bp | 4383645 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242584 |
Product | hypothetical protein |
Protein accession | YP_001508004 |
Protein GI | 158315496 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA AGCCCACCGA GTCGAAGATC GAGCTCGACC TCTCCGACGT CGACCACCGG GTCGGCCTGC CGATCGGCGG GGGGCAGCTG TGGGACCCGT GCACCGCGAC GGACATCCGC CGCTGGGTGA TGGCGATGGA CTACCCCAAC CCGCTGCACT GGGACGAGGA GTTCGCGCGT GAGTCCCGGT ACGGCGGCCT GATCGCCCCG CAGTCGATCG CGGTCGGCCT GGACTACGGC CACGGCTGCG CGCCCGCCTG CGTCGGACGC ATCCCCGGCA GCCACCTGAT CTTCGGCGGC GAGGAGTGGT GGTTCTACGG CAGCCCCATC CGGGTCGGCG ACAAGTTGGT GCAGGAGCGG CGTTTCCACG ACTACAAGGT CGCGGAGACG AAGTTCGCCG GGCCGACCAT GTTCTCCCGC GGGGACACCG CCCACCGCAA CCAGCACGGC GCCCTGGTGG CCCGCGAGCG CTCCACCGCC ATCCGCTACC TCGCCGCCGA GGCGGAGAAG CGCGGCATGT ACGAGAACCA GGTCGGCGCG GTGAAGCGCT GGACGAGCGC CGAGCTGGCC GAGATCGAGA AGCTCCGCGA CAGCTGGCTC CACTCGAACC GCACCGGCCT CTCGCCCGCC TTCGAGGACG TCAGCGTCGG TGACACCCTG CCGCGGCGGG TGATCGGCCC GCACAGCATC GCCAGCTTCA CCACCGAGTA CCGCGCCTTC ATCTTCAACA TCTGGGGGAC GTTCCACTGG ACGGCGCCCC CCGGCATCGA GGACCCCTGG GTCTACCAGG ACCCGGGCTG GGTGGAGGGC TTCGGCTTCG ACGAGGAGGG CGCCCGGATC GACCCGCGCC TGCGCGACGG GCTCTACGTC GGGCCGTCGC GCGGTCACAT CGACAGTGAC AAGGCCAGCG AGGTCGGCAT GGCCCGCGCC TACGGCTACG GCGCCACGAT GGGCGCCTGG TGCACCGACT ACCTCTCCTA CTGGGCCGGC CACGACGGCA TGGTGCGGCA CTCCAAGGCC AGTTTCCGCC TCCCCGCCTT CGAGGGCGAC GTCACCTACT TCGACGGCGA GGTGGTCGGC AAGGAGGAGG GCTCGGTGTG GGGCGTGCCG CTGGTCCAGG TGAAGCTGCG GCTCACCAAC CAGGACGGCG GCGTGCTGGT GGACTGCACC GCCGAGGTCG AGCTGCCGTA CCGGCGCGAC CGCGTGGCCG GGAGCTGA
|
Protein sequence | MSEKPTESKI ELDLSDVDHR VGLPIGGGQL WDPCTATDIR RWVMAMDYPN PLHWDEEFAR ESRYGGLIAP QSIAVGLDYG HGCAPACVGR IPGSHLIFGG EEWWFYGSPI RVGDKLVQER RFHDYKVAET KFAGPTMFSR GDTAHRNQHG ALVARERSTA IRYLAAEAEK RGMYENQVGA VKRWTSAELA EIEKLRDSWL HSNRTGLSPA FEDVSVGDTL PRRVIGPHSI ASFTTEYRAF IFNIWGTFHW TAPPGIEDPW VYQDPGWVEG FGFDEEGARI DPRLRDGLYV GPSRGHIDSD KASEVGMARA YGYGATMGAW CTDYLSYWAG HDGMVRHSKA SFRLPAFEGD VTYFDGEVVG KEEGSVWGVP LVQVKLRLTN QDGGVLVDCT AEVELPYRRD RVAGS
|
| |