Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2668 |
Symbol | |
ID | 5671061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3155971 |
End bp | 3158907 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241583 |
Product | hypothetical protein |
Protein accession | YP_001507003 |
Protein GI | 158314495 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03607] patatin-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGTGA AGCGCCGTGG CGGCCCTCGC CAGCCTGGGG CCGAGGATCT TGAGGACGTT CGGATCGCGG TCGTGCTCAA CGGCGGCGTG AGCCTCGCCG TGTGGATGGC CGGTGTCGTC AACGAGATCA ACACGCTGGC GCAGGCCCGC TCCGCCACAC CGCTCTCGGC GGCGCCTCCG GCCTGGGCCG CCGGCACGGT GTACGGCTGT CTCCTCGACC TCGTTCACGC GACGGCACGG GCGGACGTGA TCTCCGGCAC CTCGGCGGGC GGCATAAACG GCGCGTTCCT GGCCTACGCC CAGGCATACG GCGCCGATCT GGCCGGCCTG GGTGCCAAGT GGGCCGAGCT CGGTTCCTTC GAGGAGCTGC TGCGCGACTC GACGGGACCG GACCAGCAGT CCTTTCTCCT CGGCGACGAC TACTTTCTGC CCGAGCTGGC ACGCGCGTTC ACCGAACTGG TGCCGGAGGA CAGCCGGGAG CAGTCGTACG TGCCGCCCGC CGAGCGGCCG GTCGACCTCG TCATCAACAC GACGCTGATG CACGGCCGGT CGACGCGGCA CGTCGACGAC TTCGGCACGG AATTCGAGGA ATCCAGCCAC AGCGGCGAGC TTCGATTCAC CCGCACGGCG GACACTCCGC CGTCGGACGA TGTGTTCAGG GGCGCCGCCA TCGCGCGTCA GCTGGCGCTG GCCAGCCGGA GCACGGCGTC GTTTCCCGTC GCCTTCGAAC CGAGTTTCCT GCCGGTCGGG GAGGACGACG GGGACGCCTA CCACCCGGAC ATGGGCGCCG CCGTACAGCG GCCCACGGCC GCGCCGGGCC CCCGGAGCCG GTACGTCCTC GATGGCGGCA TCCTGCTGAA CCAACCGGTG CGCCCCGCGC TGGCGGCGAT CTACGCCCAG CCGGCAGACC GGCAGGTTCG GCGGCTGCTG ACGCACGTCA ACCCGGACCC GCGCTCCGCC GTCCCCGGCG AGATCGCGGG CCTTCTCGGC CGGGGCCGAG CCGGCGACGA GGGGCTCGAC AGGGCCGGGG GGCACGACAG CGCTGGCGGG CATGACAGCA CCGAGGTGCC CGACAGCGCC GGCAGGCCGC CGACCCTCGC TGCCGTGCTG GCAACGCTCG CCGTCCTGCC GTATGCGCAG TCAGTCGAGG TCGAGCTCCA GCAGCTCCAG GAAACAAACG ATCGCGTCCG GCGACACAGA TCGGGACGCG CGCACCTGAT CGGTTCCCTG GATGAGAAGC TCGCCGGCCA ACTCATGGAC CAGTACCGGA CGATGCGGTC CTGCCAGGAG GCCGACCGCA TCGGGGCCGT GTTCGCGGCG GCCCTCGACC GACAAAGGCT GTGGTGGACG CGCGACGAAC TCGCGGGGGC CCTGCTGGAT GTGGACCTCG ACGGGATCCC CTGCGTTCCG CGCGCCCTGC CCCGTCGGAA CGCCGATCCC AAGCGGTGGG CCTGGGGAAT CGGGCCGGTT GAACGCATCG GTGTGCTCGC CGTCGACCTG TTCAAACGCG CGATGTGGCT TGCCCCGCCA GCGGATCACG CGCTTCGCGG CCGGTTGCGT GGATACCGCG GGCAGCTGCA CGGCGAGCTG GCGACGCTGC GCGCGATCCG CCGCGACGAC GACGATTTCT GGCGAAGCTG GTCGGCCCGG CCGCCGGCGC AGCCCGGGGC GGACGAGGAC CCGCGGGATT CGGCGGATGC CGGCGACGGC TCGGACCCGG TCGAACCGGC GGCACCAGGG GGCTCCGCGG AACGACGGGC GCGGCTGCGT ACCTGGCTGC AGGACGCGCT CGCATCCTGG CCGCTCCCGC CCGGGGCGGC GGACATCCCC CGGGACACGT TGCCGGTACG GCTGGGCACG GTGGCGGAGC GGATAGTTCG CCTGCTGATC GATGCGGCGG ACGACCTCTG GCGGCTCGGG GCGCCGGACG CGGGTGACGC GGTAGCCACC GTTCCTCACG TGGCCGTTGA GACCGACCTG CTGAAGAAAC TGCTGCGCGC GCTCCTGTCG GATGACCTCC GGAGTTCCCC GGCAGCACTG CGGCGGCGGT TGTTGGCGGT GGAGGTCTGT CAACTCGCGA TGGCCGGCGC ACCGCGGGAC GCGGAGCAGG AGGTGATCTT CCAGCAGATC AGCGGCTTCA CCCCGAACTC GTTCGGCGGA CCGGCGACGC CGGACAAGAT CGCCGGCATC AGGCTGTTCT GGTTCGGCGG TTTCCTGAAA AGATCATGGC GGGTCAACGA CTGGATCTGG GGCCGGTTGG ACGGCGCGAC GCGCATGGTT CAGGCGGTCC TCGACCCGGC GAGGCTGCGC CAGCTCGGCC GCTCCGCTCA GGAGACCCAC GACGCGCTGC ATCGGATCGC TGTCGGTGGG GCCTACCGGG ACGACCTCTC CGGGTATTTC CGCGACGCGC GGGACGAGAT CTTCGCCGAA CTCGCCTTCC TGGACTCGGA GGATCCCGGC GCGCAGGCAG GCTCCCTGAC GGCGACGACG CGCGCGGTCA CGCGGCGGCT CCACGCCGAC ATCCTCGCCG AGGAGATGCC GCACCTCGTG GAGGCGCTGC GGCACGACAT CGCCAAGGCC TCCGGCGGGA GGCCCGTCGG CTGTCGCTTC CTGGAGAAGC AGGCCGCCAC GCTCCGGCCG GCGGCCCCCC AGCCGTCGCT GGACGACCTG TTCGCCCAGT TTCTGGAGGC GAACGCGGAG ATGATCGGTG AGCCTCTCAG GCACGCCGTT CCGCGGGGTC TCCTGGGACG CTCGACAACA CTGGCCCTCG GGCTCATCGG GGAGCTGGCG AAGATCTCGT TGTCCAACGC CGTGGAACTG GTCGTCGAGG AACTCCCGCT CGGGCTCAGG TGGCCCGCTC ACGTGGTGAC CGCGCCGGGC CGGTGGGCCA TCCGAAAGAT CTCGGGGGTG TTCGAACCTT GGGTGCCATC TCCGTGA
|
Protein sequence | MKVKRRGGPR QPGAEDLEDV RIAVVLNGGV SLAVWMAGVV NEINTLAQAR SATPLSAAPP AWAAGTVYGC LLDLVHATAR ADVISGTSAG GINGAFLAYA QAYGADLAGL GAKWAELGSF EELLRDSTGP DQQSFLLGDD YFLPELARAF TELVPEDSRE QSYVPPAERP VDLVINTTLM HGRSTRHVDD FGTEFEESSH SGELRFTRTA DTPPSDDVFR GAAIARQLAL ASRSTASFPV AFEPSFLPVG EDDGDAYHPD MGAAVQRPTA APGPRSRYVL DGGILLNQPV RPALAAIYAQ PADRQVRRLL THVNPDPRSA VPGEIAGLLG RGRAGDEGLD RAGGHDSAGG HDSTEVPDSA GRPPTLAAVL ATLAVLPYAQ SVEVELQQLQ ETNDRVRRHR SGRAHLIGSL DEKLAGQLMD QYRTMRSCQE ADRIGAVFAA ALDRQRLWWT RDELAGALLD VDLDGIPCVP RALPRRNADP KRWAWGIGPV ERIGVLAVDL FKRAMWLAPP ADHALRGRLR GYRGQLHGEL ATLRAIRRDD DDFWRSWSAR PPAQPGADED PRDSADAGDG SDPVEPAAPG GSAERRARLR TWLQDALASW PLPPGAADIP RDTLPVRLGT VAERIVRLLI DAADDLWRLG APDAGDAVAT VPHVAVETDL LKKLLRALLS DDLRSSPAAL RRRLLAVEVC QLAMAGAPRD AEQEVIFQQI SGFTPNSFGG PATPDKIAGI RLFWFGGFLK RSWRVNDWIW GRLDGATRMV QAVLDPARLR QLGRSAQETH DALHRIAVGG AYRDDLSGYF RDARDEIFAE LAFLDSEDPG AQAGSLTATT RAVTRRLHAD ILAEEMPHLV EALRHDIAKA SGGRPVGCRF LEKQAATLRP AAPQPSLDDL FAQFLEANAE MIGEPLRHAV PRGLLGRSTT LALGLIGELA KISLSNAVEL VVEELPLGLR WPAHVVTAPG RWAIRKISGV FEPWVPSP
|
| |