Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1855 |
Symbol | |
ID | 5670257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2225925 |
End bp | 2228408 |
Gene Length | 2484 bp |
Protein Length | 827 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240776 |
Product | hypothetical protein |
Protein accession | YP_001506199 |
Protein GI | 158313691 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.152269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00336297 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGATCCACA TACACAATCG GGGGCCCAAA GGGGACCATT TGGACAATTC CGCGGAACGA TCGTGGACCA TGGGTGCCGT GACGGCACCT GGACAGCTCC ACGCCTCCGG ACCCCGCGGC GCACTGGAGG CCCCTGGCCG ATCCCGACCC GGCGACGGCG ACGCCTGGAC CACGCCCGCC GTGAGCCCGG CCACCGGCGA CGACGAGAGC GGCTCCGGCC ACCGCGCCGC CCTGGGGCCG GCCGCGGGCC CGTCCGAGCT CGAGCCCGGC TTCGAGGCCG AGCCCCGCCC CGGGCTGAAC CGGCGCGGGC GCTTCCGCCG GCGGTCAGTT ACCGGACGGC GCTCCCCGCT CACCCGGCAC TCCGTGTTCG GAGTTTTCCT GCTCGCCGGC GTCGTGCTGC GGCTCGTCAC CACCTACGCC TACCGGCCCG TCTTCGAGTT CAACGGCGAC TCGTACGCGT ACATCCGGCT TACCCGGCTG TCAGAACCGG ACCCGATGCG CCCGGCCGGC TACCCGGCGT TCCTGCGCGT ACTGACCGAG ACCGGAGCCG ACCTGTGGGT CGTCCCCCTG GTACAGCACG TGCTCGGGAT CCTGCTCGCG ACGGCCCTCT ACGTGCTGCT CCTCCACCGC AGGGTGGCCC CGCCGATCGC CGCGCTCGCC ACCGCACCGA TCCTGCTCGA CGCCTACCAG ATCGTGATCG AGCACTTCGT GATGGCCGAG ACGCTCTTCG CCGTCCTCCT CGTCGCCGCG GTCGTCGCCC TGATGTGGTC ACGCCGGCCG TCTGTATGGG CGTTCGCCCT GGCCGGCCTG CTCCTCGGCG CGTCGGGCCT GGTGCGCACG ATCGGCGTGG CTATCGGCCT GCTGGCCTTC GGCTACGTGG TGCTGCGCCG GGTCGGCTGG CTGCGGGTCG GGGTCTTCGC CGTCTTCCTG GCCGCGCCGC TGATCGCGTA CGCGTCCTGG TTCCAGTCCG CGCACGGGAA GATCGGGCTC ACCGGGGGCG ACGCCGCCTG GCTGTACGGC CGGGTCGCCC CGATCGCCGA CTGCGGCCAG CTCGACCTCG AGCCCAGCCA GCTCTCGCTG TGCTCCCCGC ACCCGGTCGG CGAGCGGCCC GACCCGAGCT ACTACGTGTG GGACCGCAAC AGCCCGAGCA ATCAGCTCGA CGTCCCCGTC GACGAACGCG ACCGGCTGCT GAACGACTTC TCGCGGCAGG TCATCACCCG GCAGCCCGTT GACTACCTGC GCATGGTCGG CTCCGACATC GCGCACTACT TCGAGCCGGG ACGCCGCGTC GGGCCCCGGG ACTGGCCGGA CGCCACCTGG CGCTTCCCGA CCGCGGACGA GCCCCGCTAC CTGCACAACG ACGAGCCGCT GCTGGGCCTG CACGGGGAGG CCGACCGGCC GGATCGGACC GTGATCGAGC CGTGGGCCGA CTATCTGCGG GCCTACCAGA GCCGGGGTTT CACCCCCGGC CCGGCGCTCG CCGTGGCCGG TGTCCTCGGC CTGCTCTCCT GCCTGGCCGC GCTGCCGCGG GTCGTCCCGG CGGGCCTGCG CGGGGGTGAC GGGCGTGCCC GCTGGCACGA CCTGACCGCG GAACGCCGGC GCACCGGCGC CGACTGCCTG TTCCTGGTCG CGACCGGGGC AACGATGATC ATCGTGCCGG CGGCCACCGT CTGCTTCGAC TACCGGTATC TGCTGCCAGC GCTGTTCCTG CTGCCGCCGG CAGCCGCGCT CGCCGTCCAC CAGGGCCACC TGCTGGTCGT CGCGTGGCGT GAGCGGCGCG AGGCCGCCGA GACTCCCTGG CGTACGCCTC CCGGCCCGAC CGGCCTCGGC GCGGCCGACC CTGACACCGA CGGTGACCCG ACCGGTGCTG ACACGCTCAG CGCCACCGAG ACCGGCCCTG ACGGCGGCGG CGCGACCGGT TTCGGCGCGC CAGGCGATCG CGCCGATCCG GACGCCCCGT TCAGCCTCGG CGGTCTCGGC GCGCCGCCGG GCCGCGTGGC CCCGGCAGGC CACGGCGATC ATCGCGAAGG CAGGCCCACG AGGCCGGCCA CGCCGGCCAG ACCCATCCCG CCGACGACAC CGAACCCGCC GGCGCCGATT CCGCCGACGG ATCCGATTCC GTCAGCGGCG TCGGCCAGGC CGGCCCCTCT CCGGAGCGGA CCGAACCCGG GCGCGGCGGC GGCACGGCGG CAGAACCCGT CCGGCACACC GCCGCTGCCG AAACGTGCGC CCGGGGTCAC GCTCGAGGCG AGGAACCGCC GATCGCGTCC CGCCGCAGCC GGCGGGGCCG CCCCTGAGAC GCCGTCCGGA CGAAGCACGC CGCAACGCCG GCCCGTGAGC CTCGGCCCGG ACGCCCGGAT GCCCACCCCG GGCAGCCGAC CCGCGGCGCG CGGCCCGGCC GCGGCGAACG CCGGGGACGA CCCGACGGCG CCGCCCACGA AGGTCGAGCC CGACGCCGGT GACGATCCGA CGTTCCCCGG TTGA
|
Protein sequence | MIHIHNRGPK GDHLDNSAER SWTMGAVTAP GQLHASGPRG ALEAPGRSRP GDGDAWTTPA VSPATGDDES GSGHRAALGP AAGPSELEPG FEAEPRPGLN RRGRFRRRSV TGRRSPLTRH SVFGVFLLAG VVLRLVTTYA YRPVFEFNGD SYAYIRLTRL SEPDPMRPAG YPAFLRVLTE TGADLWVVPL VQHVLGILLA TALYVLLLHR RVAPPIAALA TAPILLDAYQ IVIEHFVMAE TLFAVLLVAA VVALMWSRRP SVWAFALAGL LLGASGLVRT IGVAIGLLAF GYVVLRRVGW LRVGVFAVFL AAPLIAYASW FQSAHGKIGL TGGDAAWLYG RVAPIADCGQ LDLEPSQLSL CSPHPVGERP DPSYYVWDRN SPSNQLDVPV DERDRLLNDF SRQVITRQPV DYLRMVGSDI AHYFEPGRRV GPRDWPDATW RFPTADEPRY LHNDEPLLGL HGEADRPDRT VIEPWADYLR AYQSRGFTPG PALAVAGVLG LLSCLAALPR VVPAGLRGGD GRARWHDLTA ERRRTGADCL FLVATGATMI IVPAATVCFD YRYLLPALFL LPPAAALAVH QGHLLVVAWR ERREAAETPW RTPPGPTGLG AADPDTDGDP TGADTLSATE TGPDGGGATG FGAPGDRADP DAPFSLGGLG APPGRVAPAG HGDHREGRPT RPATPARPIP PTTPNPPAPI PPTDPIPSAA SARPAPLRSG PNPGAAAARR QNPSGTPPLP KRAPGVTLEA RNRRSRPAAA GGAAPETPSG RSTPQRRPVS LGPDARMPTP GSRPAARGPA AANAGDDPTA PPTKVEPDAG DDPTFPG
|
| |