Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2549 |
Symbol | |
ID | 5670943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3030879 |
End bp | 3033377 |
Gene Length | 2499 bp |
Protein Length | 832 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241465 |
Product | hypothetical protein |
Protein accession | YP_001506885 |
Protein GI | 158314377 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.207629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTGC TCGTCGCGGC GACCGCCACG CCGTCGTCGC CGCGCACCAC CTACCTGACC GACCCCGCCG GCTTTCTCCA CCAGATCCTC GGCCAGCTCC AGGGGTGGGC AACGGTCTGG GGACCCGTCG CGGGCCCGCT GCTCACCCTT ACCGCCGCCG GGCTGCTCAC TCTGCGCCGG CGGATGCGCC GCCGCTACCA GCAGCGGCTG ACCGCCGGCG CCCGCCTCGT GACCGTGTTG GCCCCGCCCA CCGTCGACCC GGCGGGCGCG ACCGCGCTGT GGTCGAACCT GCTCGGCCTG CTCCGCCCCG GCTGGCGGCG CCTGTTCGGC CAGCCGCACC TGCTGTGGGA GTACCAGTTC ACCGCCGACG GGGTCCGTAT CCAGATCTGG GTGCCCGGCG TCGTCCCCGA TGGGTTCGTT GAACGCGCCA TCGAGGCGGC CTGGCCTGGC GCTCACACCC GCACCACGCC CGCCCGCGCG CCGCTGCCCG TCGTCGCCCG GCCGGGTCGG CGGCTGCTCG CTGCCGGCGG CGAACTGCGC CTGGCCCGCC CCGAAGCACT CCCGATCCGG ATCGACCACG ACGCCGACCC GATCCGCGCC CTGCTCGGCG CGCCCGGCAG CCTCGCCCGC AACCAACGGG CGGCCGTGCA GATCCTGGCC CGGCCGGTCA CCGGCCGCCG CGTCGCGAAA TCCCGCCGGG CCGCCCGCCG GCTGCGCTCC GGCGGCTCCG CGCACCTGAT CGGCGGGCTG CTGGACCTAC TCACTCCCCA CACCGGCCGC ACCCGGCGGC GCCACCGGAC GGCGACCACC ACGGTGAAGG TCGACCCGCA GACGTCGCTG GCCCTGTCCG CGGAAGACCG CGCCATCGTC ACCAAGCAGC GAGGGGCCCA GTACGAGGTC CGCGTCCGCT ACGCCATCGC CGCGATCCTC GACGAGCACA CCGACGACAC CACCGCCGCC CAGGTGGCGA GCCAGCTACG CGGCCGGGCG CACGCCATCG CGTCGGCGTT CTCCGCCTAC GGCGAGCACA ACTACTACCG CCGCGTCCGG CTCCGCCGCC CGCTGCCCGT CCTCGCCACC CGGCAGTTCG GCCGTGGCGA CCTGCTCTCG GTCGCCGAAC TCGGCGCGCT GGCCCACCTG CCGGTCGACG AGGCGACCCC TGGCCTGCAA CGCGCCGGTG CGAAGGCGGT CGCCCCGCCG CCCGGCGTCG CCGGCCACGG TCCGAACGTC CGCCCGATTG GGCGCACCGA CGCCGGTCAC GCGCGTCCGG TTGGTCTGCG GGTCCCGGAC GCCCGTCATC ACCTGCACGT CCTCGGCGCG ACCGGCGCCG GCAAGTCCGA ACTGCTCGCC CGCATGACCC TCGACGACGT CGCCGCGCAC CGAGGCGTGG TCAATGTCGA CCCGAAGGGC GACCAGATCA TCGACATTCT CGCCCGCTAC CCCCTCGACG CCCTCGACCG CCTCGTCCTG TTCGACGCTG AGTCGTCCAG CCGGCCGCCG TGCCTGAACC CGCTGGATCA GCCCGACCGG GGACGGGCTG TCGACAACCT CGTCTCGATC TTCAGTCGGG TGTATGCCGA CTCGTGGGGG CCGCGCACCG AGGACATCTT CCGCGCCGGC CTGCTCACCC TCGCCGCCCA ACCCGGTGTC CCCGTCCTGA CTGACCTACC GAAACTGCTC ACCGACACCG CCTATAGGCA CCGGGCGCTC GGTGAGATCG ACGACGACAT CCTGGCCGGC TTCTGGACCT GGTACGAAGC TCTCTCCGAC GCCGCCCGCG GTCACGTCGT CGCCCCGCTC ATGAACAAAC TCCGGGGCTT TCTGCTGCGG CCGTTCGTGC GGGCCGCGAT CGCCGCCGGC CCCTCCACCG TCGACATGGA CAACGTGTTG AACGACGGCG GGGTGTGCCT GGTCCGCATC GCCCAGGACG CCCTCGGTGT CGAGACCGCG GCACTGATGG GCTCCATCGT CGTCTCCGCC GTCTGGCAGT CCACCACCCG CCGCGCCCGC CTACCCCAGG GGAAAAGGCC CGACGCCAGC CTGTACTTGG ACGAGGCACA CCATTTCCTC ACGCTTCCGT ACGCGTTGGA AGACATGCTC GCCGCCGCCC GTGGCTACCG GCTCGGGATC ACCCTGGCGC ACCAGAACCT TGTCCAGCTG CCTCGGCATC TGGAAGAAAG CATCGGCGCC AACGCCCGCT CGAAGATCTA TTTCACGGTG AGCCCGGCGG ACGCCAAGCG CCTCGCCCGG CACACCGAGC CCCGGCTCGC CGAGCACGAC CTGGCCAACC TCGGCGTGTT CCACGCCGCC GCCCGCCTCG TCGTCGGCGG AGAAGAAGCC CCCGCCTTCA CCCTCGTCAC GGAGAAACTG CCCCCGCCAG TACCCGGCCG CGCCGTCCAG ATCCGCCGGG CCTTGCGCCG CCGCGCCGCC ACCCCCGCCG CGCCGGCTCC CACCGGTCCG GGTCCGCGGC CCACCTCCGA CCCGCGTCGC GTCGCCTGA
|
Protein sequence | MDLLVAATAT PSSPRTTYLT DPAGFLHQIL GQLQGWATVW GPVAGPLLTL TAAGLLTLRR RMRRRYQQRL TAGARLVTVL APPTVDPAGA TALWSNLLGL LRPGWRRLFG QPHLLWEYQF TADGVRIQIW VPGVVPDGFV ERAIEAAWPG AHTRTTPARA PLPVVARPGR RLLAAGGELR LARPEALPIR IDHDADPIRA LLGAPGSLAR NQRAAVQILA RPVTGRRVAK SRRAARRLRS GGSAHLIGGL LDLLTPHTGR TRRRHRTATT TVKVDPQTSL ALSAEDRAIV TKQRGAQYEV RVRYAIAAIL DEHTDDTTAA QVASQLRGRA HAIASAFSAY GEHNYYRRVR LRRPLPVLAT RQFGRGDLLS VAELGALAHL PVDEATPGLQ RAGAKAVAPP PGVAGHGPNV RPIGRTDAGH ARPVGLRVPD ARHHLHVLGA TGAGKSELLA RMTLDDVAAH RGVVNVDPKG DQIIDILARY PLDALDRLVL FDAESSSRPP CLNPLDQPDR GRAVDNLVSI FSRVYADSWG PRTEDIFRAG LLTLAAQPGV PVLTDLPKLL TDTAYRHRAL GEIDDDILAG FWTWYEALSD AARGHVVAPL MNKLRGFLLR PFVRAAIAAG PSTVDMDNVL NDGGVCLVRI AQDALGVETA ALMGSIVVSA VWQSTTRRAR LPQGKRPDAS LYLDEAHHFL TLPYALEDML AAARGYRLGI TLAHQNLVQL PRHLEESIGA NARSKIYFTV SPADAKRLAR HTEPRLAEHD LANLGVFHAA ARLVVGGEEA PAFTLVTEKL PPPVPGRAVQ IRRALRRRAA TPAAPAPTGP GPRPTSDPRR VA
|
| |