Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1484 |
Symbol | |
ID | 5669888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1773858 |
End bp | 1777292 |
Gene Length | 3435 bp |
Protein Length | 1144 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240404 |
Product | hypothetical protein |
Protein accession | YP_001505830 |
Protein GI | 158313322 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0469742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG AACTGGAGGC GCTCGCCGCC ATCAAGCTGG ATTGGGTACA CGGGCTCGAG GACGTCTGGC GCGACTCGCC GGTGCACGTC GACGGCGTCC ACGTCGAGCC GCGCCGGCGC ATTCTCGCGG CTCTGGACCG GCTCGGCGAG CGGAAGCCGC CAGGCATCGT CGTCCAGGGT CAGGCCGGTG CGGGGAAGAC ACACCTGCTC GGCTGGGTAC GTCAGCAGGT CGAACGGCGG GGCGGTTACT TCTTCCTGAT CGGCCCGGTG AGTGGGGACG CGTTCGCCGA GAGCGTCGTC TCGGCGATGC TCACCGGCCT CTTCCGGACC GTCGACGGCG AGGCGGAGAT CCGGCGCTTC CTTCTGGCGC TGTGCGCGCA GTTCCGCGTG CCGGCGGAGC TGACCTCCGC GGTCGTCGGG GACCAGCCTC TGGAGCCGAC TCATCTCAAG GACCTCGTGG TCGCGTTCTT CGACCACGAC CGGGCGCTGG CTCGCGAATG CCAGCACACC CTGCGCGCCC TGGTCCTGTA CAGCTCGGAC GACTATGACG CCCAAGCGGT CGGCGAGACC TATCTGCGGT CGATGGACGA GCAGGAGCCC GGCGAACGGG CCCGCCGCGG CATGCAGGCG CGGACCCGCC CGCCGATGGA CATCGCCCGC GATCTGTCCC GGCTGCTCGC GCTGACCGGA CCGAGCGTCA TCGCGGTTGA TCAGGTCGAT GGAGTCGTCG CCGAGCTCAC GGCACGCGAC GAGGCAGCGG ACAGCGGCCT GGGCGGCTTC GGCGGGTCGG GTGCGGTGCG GCTGAACATG CTGGCGGACG GCCTGCTCGT CCTGCACGAG CGCATGGAAC GGACGCTGTG CCTGCTCGCC TGTCTGCCGG TCAGCTGGAT TCAGTTCCGT GAACGAGCGG TGGCCTCCAT GCGTGACCGG TTCTCGGCGA TCATGCCACT CAACCGGCTG GACGACCCGG AGATCACGGC CAGGATCATC GAACGCCGGT TCCAGGAGGA CTTCGACCGC ATCGGCTTCG CCCCGCACTA TCCGACCTGG CCGGTAGCCC GGTCAGCCTT CGACCGCGCG CCCCAGTACA CCCCGCGCGA GCTGTTGCAG CGCATCGACC GGCATATCCA GGAGTGCTTG TCCACGGGCG TCGTCCGTGA ACTGTTCGAC TTCGCCGCCG CGGACGAGAA TGACATCGTC GATGTTCAGG ACAACGGTGA CCACGACATC ACCGACATCA CCGTTAGCAC GAATTTCACC GATGGCACCG ACCTGGACGG CGAGCGCCTG GATCGGCTGG ATCGGCTGGG GCGCGAGGAC CGGAGCGGCG GTGGACGCGG CCGAGCGGCT GACGAGGTCG GCGCGACCAG TCCTGAGGTG CTCAGCGCGC TCGACGCCCA GTTCGCCGAG CTACGAGCAG GCGCGCCGCG CGTCGCGGCA GTCAACGAGG ACATCCTGGT GCCCGTGCTG CTGACCGCCG GACTGTCCGC GTGGATCATC GAGAACGGCG GGGCGCACGA CTACAGCCTC GAACCGATGC CGAGCACACG CCCAGCGCTG CACGCCCGGC TGACACGCAC TGTCGACGCC GACACCGGCG AGCAGCTGCA CTGGTGCTTC CGGGCAATCA CAGCCAGCCA CGGCAACGCG ATTCTGCCGC GTATCACCGC GGCCTGCACC GCCGCCGCCA TGGTCGCCGG CAACCCGCGG CGACGGCTGT TCCTGCTACG GAACGATCCG TGGAGTCCAG GGAAGAAGGT GCAGCAGGCC CTGGCCGCGT TCAGCGCCGC CGGCGGCCGT ACCCTGCGGT TGCACGCGGG CGACCTGGAC ACTTTCATGG CGTTGCGGAC CCTGCTCGAC CAGCGGCCAG CCGGGCTGCG CGCGTGGCTG GCCTCCCGGC GCCCCGCCAG CGGCACCTGG CTTCTGCGGG AGGCACTCGG CGACGCGGCC GGCACCGACC CGACCGGCTC CGCCGCCCTG GCGCCCCGCG CCCCCACCAT GACGCCCGGT CCCGGCCGCC TGGGCAGCGC TGGTGACCCT GACCCTGACC CTGATTTTGA CTTTGGCCCT GAGGCCGAGC CCGGCCCCGA ATCACCGCCA CAGGTCGCGC CGGGGCCACG CTCCGCGGCA CCCGCCGGTG TTGTTCGTTC GTCATTCGCT TCGGCGGACG ACCTGGGTGG CGGTGTCGCG GCCCCGTCCG CGCTCACGCT CGGCACGGCG GTACGGGGAG ACGCGCCGGC CCGGATCGAT CTGGCCGTGC TGCGCCGCCA TGTCACGATC TTCGCCGGAT CGGGCTCGGG CAAGACGGTG CTGATCCGCC GGCTGGTCGA GGAGTGCGCG CTTCGCGGGG TGTCCTCGAT CGTGCTTGAT CCGAACAACG ACCTGGCCCG CCTCGGTGAC GCCTGGCCAA GCCCGCCGGC CGGCTGGAGA GACGGCGACG CCCAGTTCGC CGCCGAGTAC CTGGCCGCCA CGGACGTGGT GATCTGGACG CCGCGTCGCG ACGGCGGGCG GTCGCTGACC TTCCAGCCGC TGCCGGACTT CGCCGGGCTG CGCGACGACC CGGACGAGTT CGAGGCGGCG ATCGACGCGG CCGTCGCGAC GCTGGCCGTC CGAGCGAACG TCACCGGGAA CGCGAACCGA GCCAGCCTCG GACGGGCAGT TCTGACCGCG GCGCTACGCC ATTTCGCCCG CGGTAACCGG ACTGGTCTGA CCGCGTTCAT CGAGCTGCTG GCCGAACTGC CGGACGGGGT GGCCGACGAC CTCGACCCGG CGGGGAAACG CTCTGTCGAG CTGGCGCAGA CGCTACGAGC GGCGATGGTC AACGATCCGT TGTTCGGCGG AAAGGGCGAG CCGGTCGACC CTGGGACGCT GCTTGTCCCC GCGCCGGGGC AGCGTGCCCG AGTTTCGGTG ATCAGCCTCG TGGGGCTGCA ACATGATGAG CAGCGCCAGA GCTTCGTCAA CCAGCTCCAG CTCGCCCTGT TCTCCTGGGT GAAGCGCAAT CCGGCCGGAA ATCGGCCCCT CGGCGGGTTG TTCGTCATGG ACGAGGCGCA GACGTTCGCC CCGTCCGGCC CGTCCACCGC CTGCACCACG AGCACCCTGG CGCTCGCCTC GCAGGCCCGC AAGTACGGCC TTGGGCTGGT CTTCGCCACC CAGGCGCCGA AGAACCTGCA CAACGGAATC CCCGGCAACG CCACCACACA GATGTTTGGC CGGCTCAATG CCCCGGTCCA GATCGAAGCG GCCCGGATGA TGGCCCGCGC CTGCGGCGGC GACGCCCCGG ACATCGGCCG GCTCGCCGTC GGCGAGTTCT ACGCCACGAG CGAGACCCTG CCATTCATGA AGATCCGCAC ACCGATGTGC CTGAGCCACC ATCCGCCTTC GCCGCTGACC GCCGAGGAGG TCATCACCCG CGCCCGCCCG GACTTGACCG CATGA
|
Protein sequence | MSAELEALAA IKLDWVHGLE DVWRDSPVHV DGVHVEPRRR ILAALDRLGE RKPPGIVVQG QAGAGKTHLL GWVRQQVERR GGYFFLIGPV SGDAFAESVV SAMLTGLFRT VDGEAEIRRF LLALCAQFRV PAELTSAVVG DQPLEPTHLK DLVVAFFDHD RALARECQHT LRALVLYSSD DYDAQAVGET YLRSMDEQEP GERARRGMQA RTRPPMDIAR DLSRLLALTG PSVIAVDQVD GVVAELTARD EAADSGLGGF GGSGAVRLNM LADGLLVLHE RMERTLCLLA CLPVSWIQFR ERAVASMRDR FSAIMPLNRL DDPEITARII ERRFQEDFDR IGFAPHYPTW PVARSAFDRA PQYTPRELLQ RIDRHIQECL STGVVRELFD FAAADENDIV DVQDNGDHDI TDITVSTNFT DGTDLDGERL DRLDRLGRED RSGGGRGRAA DEVGATSPEV LSALDAQFAE LRAGAPRVAA VNEDILVPVL LTAGLSAWII ENGGAHDYSL EPMPSTRPAL HARLTRTVDA DTGEQLHWCF RAITASHGNA ILPRITAACT AAAMVAGNPR RRLFLLRNDP WSPGKKVQQA LAAFSAAGGR TLRLHAGDLD TFMALRTLLD QRPAGLRAWL ASRRPASGTW LLREALGDAA GTDPTGSAAL APRAPTMTPG PGRLGSAGDP DPDPDFDFGP EAEPGPESPP QVAPGPRSAA PAGVVRSSFA SADDLGGGVA APSALTLGTA VRGDAPARID LAVLRRHVTI FAGSGSGKTV LIRRLVEECA LRGVSSIVLD PNNDLARLGD AWPSPPAGWR DGDAQFAAEY LAATDVVIWT PRRDGGRSLT FQPLPDFAGL RDDPDEFEAA IDAAVATLAV RANVTGNANR ASLGRAVLTA ALRHFARGNR TGLTAFIELL AELPDGVADD LDPAGKRSVE LAQTLRAAMV NDPLFGGKGE PVDPGTLLVP APGQRARVSV ISLVGLQHDE QRQSFVNQLQ LALFSWVKRN PAGNRPLGGL FVMDEAQTFA PSGPSTACTT STLALASQAR KYGLGLVFAT QAPKNLHNGI PGNATTQMFG RLNAPVQIEA ARMMARACGG DAPDIGRLAV GEFYATSETL PFMKIRTPMC LSHHPPSPLT AEEVITRARP DLTA
|
| |