Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2728 |
Symbol | |
ID | 5671119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3225870 |
End bp | 3229160 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241640 |
Product | hypothetical protein |
Protein accession | YP_001507060 |
Protein GI | 158314552 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGGTG GCGAATCGCC TACGGCCGTG CTGACTGATG AGTCCGCAAC CGAGGTACCC GGGCTGTTCG ACGAGTTGTA CGCGACGCCG AAGCGACTGG AGGACCTCGG CGCGCCCCTC AGACGGTTCG CAGAGGCCAC GGGAACGGAC GGCGACGGGA TGCGCCAGGC TTTCGCCTTC TTCCTGACGG CTCACGGCTA CGCCCCTCAT CTCGTCCCGG AGTGCCTCCG CGCATTTTTT GCGGAGATAG CGGCCCATAC GGACGTGTCA GCCGGGCAGG GCGATCTGAT GCAGGCTGAG GCGCTCCTCG CGCGCGCTGA GCGAGCCGAT GCTGGCGGGG TGATCGGCCT GTTGGACTCG GCGGTCCGGA TGTTGCGCCG GGCCGTCGCC AACTTCGGCA TTGCTGACTC CGGTCGGTTC AACGCGGTCG GCCTGCTCTG CCTTGCGCTT CGATGGCGCT ACCGCCTGAC GGGTCGGGGG GAGGACCTGT CCGAAGGTAT CGACCTCCTG CGTGCGTCGA TCTCGGAGTG GCCGGACGAT CCTCGCCGTG GGGCGCTCCA CGCCGAATTG AGCGTCGCGC TGCGGCTGCG CGCCGAACGG ATCGGCGACC GAGCCGACCG GGACGAGGCC GTCGAGCAGG GCCGTCTGGC GGTCGGCAAG GCGCAGGACC ATCAGTGGGT CGTCGCGGAC GCGCTCGCCG CGTTGGGAGC CGCGCTGCTG ACACGGGCCG CGTGGATCCT CAGTCCTGAC GATGTAGAGG AAGCGGTAGC GGTCGCGCGG CAGGGAATAG TAGTAGCAGC CCCCGACCTT AATCGACTGG GCTACGCGTT AGAACGGCTA AGCGTGGCAC TACATACGAG ATTTGATACA TTCGGTGACG TCGCGGACCT GGTCGAGAGC GTGGAGGTCG CCCGCCGCTG CGTCGAGGCG ACCGAAGACG CGTCGCTGGA CGCGGTGGGC CGACGGTCAC AGCTGTGCTT CTGCCTCCTG GACCTCGCGC AGACGACCGG GTCAACAGAA CTGCTCGACG AAGCGATCGC CCAGGGTCGC GCGGGCATAA ATGCGCTCAC GGCGTTCCAT CCACGTGCCG GGCTACTGTG GGTTAATCTT AGCGACTGCT ACCGCACCCA CTATGAGGTC ACGCGTTCGG TGACCGCCGT CGAGGCAGCT GTCCATGCAG CGCAGGAAGC GCTCGCGGTA ACTTCGGCCG GCCATCCAGA CCATCCCAAA GCGCTGTTCG CGCTGGCCAG CGCGCTGTGG TGGCAGTTCG AGCGAATGGG CGACACCGTA GATCTAGACG CGGCGCTCGA CTATGCCCGG CAGGCACTGG ACGCGACGGC ATATGAGCGG CTCGACTATC CTAAGATTGC TGGGTTGCTC GGACAGCTGG CTGCGGCCGC GGCAGGTCCC GCCAGCATCG ATGCGTTCGC CGAAGCAGTC GCCGCGGGAA GATCGGCAGT CGCGGGCATC CCTGACGGAC ATCCTGACCT TCCGCTCTAC CAGTCGAGCC TGGCCGCGGT TCTCCGCCGG CGAGCGGATG CGATCGGACA GCTGTCGGAT CTCGACGAGT CGATCTCCCT GGACCGAGCC GCAGCGGATC AGGTTCCCCC TGGCCGCCCC GATCGCGCAC GGCTCCTCTG CAATCTCGGA AACACCCTGC GCCTGCGTTT CCGAGTCACC GGGGACAACG AAGATCTTCG GGATTCCCTG CACTTCTTCA GGACGGCCTT GAACACGGGG ACGGCTCTGC CGTCCACCCG CCTGCAGGCA GCCAGCAGCC TCGGACGACT GGCCGGCGAG ACGGGACAAT GGGCCATCGC CGCCGACGGC TACGCCACGG CCATTGGCCT GCTGCCGGTG GTGTCCGCGC CAACCCTCCT TCGAACCGAC CAGGAATTCG TCCTCAGCCG GCTACCCGGT CTCGCCGCCG ATGCGGCGGC GTGTTTCCTT CGCGCCGGCA TGCCGGAACA GGCTGTCCAG CGTTGGGACG AGGGCCGCGG GGTCCTGCTA GCTCAGGCTC TCGGTATGCG AAGCGACCTC TCCCGACTCG CGGACCATCA TCCCGAGGTC GGCCGACGCG TCGGCGCTGC CTGGGGCCGT GCCCTTCGCA ACCAGGAACC GCGGCCATCG ACCGACGCGG CCAGTGACCT CAAAGAGCTG ATCCAGGAGG TCCGCGGCCT GCCTGGATGG GAGGGCTTTC TCGCGCCGCC GACGCCGGTC GACCTCGACA TGGCAGCCCG TGGTGGCCCA GTCGTCGTGG TCAACGTCTC CCAGTTCGGC TCAGACGCAA TGATCATCAC CGCTGCCGGT ATCAACACCG TGCCGCTCCC AGGACTCACG CCTATTGCTG TGCAAGACCA GGTCATCAGC TTCCTGCAAG CCTCGAATGG CGCCACCGCC GATCCCGACT GCGAAATGCA ACAGACGCTC TCGTGGCTGT GGAACGCGAT CGCCAGTCCA GTACTCGACA GGCTCGGTGT CAGCGCGGTG CCCGATGCCG GTGAACCGTG GCCTCGGCTG TGGTGGTGCG TCTCCGGGCT GCTGTCGTTC CTGCCGCTGC ACGCAGCCGG TCACCACCAC ACAAGGTTCG ATCCTGAGCC ACAGACCGTC ATCGACCGGG CCGTGTCCTC GTACGCACCG ACCATCACCG CACTCATCGG CGCGCGAAAC AAACGCACCG CAGCGCCTAG CGTTGGCAAC GCTCCGGACG TCCTCGTCGT CGCTCAACCA CAGACCCCCG GCCAATGCAG CCTGCCCGGC GTACTCACGG AAGTCGCGGA CCTTAAAACC CGTCTGCCAG GCCGGATCAC CGAGCTGACC GGAGCGCACG CTACTCGGGA AACTGTGCTC CGAGCCCTTA CTTCCTCTCA CTGGGCCCAC TTCGCCTGCC ACGGGACCAG CGACCTCGCC AACCCGTCGT CCAGCAGCCT GCTCCTGCAC GACTATCAGA CCTCACCACT AACAGTGATC GACCTCATGC AGCTCCAACT CGATCAAGCG GAACTAGCAT TCCTCTCCGC CTGCGAGACA GCCCGCCCCG GCGCACAGCT CAGCGACGAA GCCATTCACC TCACCGCCGC CTGCCAGCTC GCCGGCTTCC GCGACGTCAT CGGGACGCTC TGGCCAATCA ACGACCTGGC CGCCGTCGAA TTCGCCGATG CCTTCTACAG CGCCCTCCTG AACGAGTCCA CCGATGTCGC AGCCGCCGCT CATGCCGCAA CACGGCACAC CCGCGAGACA TGGATCGACC AGCCCTCTCT CTGGGCTGCC CACGTGCACG TCGGCACCTG A
|
Protein sequence | MFGGESPTAV LTDESATEVP GLFDELYATP KRLEDLGAPL RRFAEATGTD GDGMRQAFAF FLTAHGYAPH LVPECLRAFF AEIAAHTDVS AGQGDLMQAE ALLARAERAD AGGVIGLLDS AVRMLRRAVA NFGIADSGRF NAVGLLCLAL RWRYRLTGRG EDLSEGIDLL RASISEWPDD PRRGALHAEL SVALRLRAER IGDRADRDEA VEQGRLAVGK AQDHQWVVAD ALAALGAALL TRAAWILSPD DVEEAVAVAR QGIVVAAPDL NRLGYALERL SVALHTRFDT FGDVADLVES VEVARRCVEA TEDASLDAVG RRSQLCFCLL DLAQTTGSTE LLDEAIAQGR AGINALTAFH PRAGLLWVNL SDCYRTHYEV TRSVTAVEAA VHAAQEALAV TSAGHPDHPK ALFALASALW WQFERMGDTV DLDAALDYAR QALDATAYER LDYPKIAGLL GQLAAAAAGP ASIDAFAEAV AAGRSAVAGI PDGHPDLPLY QSSLAAVLRR RADAIGQLSD LDESISLDRA AADQVPPGRP DRARLLCNLG NTLRLRFRVT GDNEDLRDSL HFFRTALNTG TALPSTRLQA ASSLGRLAGE TGQWAIAADG YATAIGLLPV VSAPTLLRTD QEFVLSRLPG LAADAAACFL RAGMPEQAVQ RWDEGRGVLL AQALGMRSDL SRLADHHPEV GRRVGAAWGR ALRNQEPRPS TDAASDLKEL IQEVRGLPGW EGFLAPPTPV DLDMAARGGP VVVVNVSQFG SDAMIITAAG INTVPLPGLT PIAVQDQVIS FLQASNGATA DPDCEMQQTL SWLWNAIASP VLDRLGVSAV PDAGEPWPRL WWCVSGLLSF LPLHAAGHHH TRFDPEPQTV IDRAVSSYAP TITALIGARN KRTAAPSVGN APDVLVVAQP QTPGQCSLPG VLTEVADLKT RLPGRITELT GAHATRETVL RALTSSHWAH FACHGTSDLA NPSSSSLLLH DYQTSPLTVI DLMQLQLDQA ELAFLSACET ARPGAQLSDE AIHLTAACQL AGFRDVIGTL WPINDLAAVE FADAFYSALL NESTDVAAAA HAATRHTRET WIDQPSLWAA HVHVGT
|
| |