Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2552 |
Symbol | |
ID | 5670946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3035815 |
End bp | 3039129 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641241468 |
Product | hypothetical protein |
Protein accession | YP_001506888 |
Protein GI | 158314380 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAACAGT TCCGAGCGCG ACTCGACAGA CGTAGCCCCG TCTTAACCGC CCGTCTGCAG ATCACGGCGA TGCGCGCTGA AGAACTTATG CGGAACACGG TGCGTACCGG TGACCGTGCC GGGCTGACCG CGGCGATCGA CCTGCATCGG CAGGCGCTGG ACATCTGTCC TTCCGACCAT CCGGCGCAGG CCGCGATACT GGCCAATCTG GCCGCCGCAC TGTATGTCCG GTTCGACTGG TCAGGCAGCA GGGCAGATCT GGATGAGGCC GTCGTCGCAG GCCGCGCGGC GCTGGCCGCC CATCGACCCG GCGATCCCAA CCTGGCCGGA TGGCTGTCCA ACCTCAGCGC GGCGTTGCGT ACCCGGTCCG AGCGGTGGGG CAGCAGGGCG GATCTGAACG AGGCCATCAG CGTGGGCCGC CACGCGGTGG CTGCCAGCCC ACCCGACCAT CCGGAAACGA TCAAACGACT GGGCAACCTG GCCGCCGCGC TGCGTATCCG GTCCGAGTGG ACGGGCAGCG AGGCGGATCT GGACGAGGCC GTGAGCGCAG GCCGCGAAGC GCTGGCCGTC AGCTCACACC ACCCGGAAAG AACCACGATG CTGGCTAACC ATGTGGCCGC GCTACGTCTC CGGTCCGAGC GGACGGGCAG CCAAGCGGAC CTGGATGAGG CCATCAGAGC CGGCCGGGCG GCGCTGGCCG CCTGGGCATC TGGCGACCCT GCGAAGGTCG CTCTGGTAAC GAACCTGGGC ACGGCGCTAC AGGCTCGGTT CGAACTGTTG GGCAACCGGG CGGATCTGGA CGAGGCCATC GGCGCCGCCC GCGACGCGGT GACGGCGAGT CCACCCGACC ATCTGGACCG TGCCACGGCG GTGGCGAATC TTGGTCTTGC ACTCCGTATC CGATCGCAGC GGACGGGCAG CGAGGCAGAT CTGGATGAGG CCGTGAGCAC CGGCCGGGCG GCGCTGGCAG CCAGCCCACC TGACCATCCC CACCGGGCCG GATGGCTGTC CAACCTCGCC GCCGCGCTGC TTATCCGGTC CGAGTGGACG GGAAGCCAGG CGGATCTGGA CGACGCCGTC AGCGCCGGCC GCGACGCAGT GGCCAGCAGC CCACCCGACC ACCCGGACAG AGGCACCCGG CTGGCGAACC TCAGCCGCGC CCTGCTGACC CGGTTCGAAC TGTTGGACAG CCAGGCGGAT CTGGACGAGG CGGTCACCGC CGGCCGCGAC GCAGTGGCCA GCAGCCCACC CGACCACCCG GGCATGGCCA GCTACCTGTC CAACCTCAGC CGCGCCCTGC TGACCCGGTT CGAACTGTTG GACAGCCAGG CGGATCTGGA CGAGGCGGTC ACCGCCGGCC GCGACGCGGT GGCCACCAGC CCACCCACTC ATCCCTACCG CGCCGCGTGG CTTTCCAATC TGGGCATCGC CCTGCTGACC CGGTTCGAAC GATCACCCAG CCAGGCCGAT CTAGACGAGG CGGTCATGGC GTGCCGCGGG GCGCTGGACG CCAGCCCGCC CGATCATCCC TACCGCGCCG CATGGCTTTC TAACCTAGGC ACCGCCCTAA GCCTCCGGTT CGAGCAGTAT GGCGATCAGG CGGATCTGGA CGAGGCGATT GCCGCGAGTC GGGCCGCCGT GGCCGTCGAG GTGGCGTCAC CTCGGCTTCG CGCGCGGGCC GCACGCGGTT GGGGACACAC CGCTGCCGAC GGGATGCGGT GGGACGAGGC CGTCGCGGGT TTCGCGGCAG CCGTTGACCT CCTGGGACGG GTGGCACCGC GTAGCCTCGC CCGCGGCGAC CAGGAACATC TACTCGCCGA GCTAGGCGGC CTGGGATCGC AAGCGGCGGC ATGCTGCGTG CACGCCGGCC TCCCGGAACG CGCGATCGAA CTCTTCGAGC AGGGCCGTGG GGTCCTGCTG GGTCAGGCAT TGGACACCCG CACCGACCTG ACCGCGCTCG CTGAGCAGTT TCCGGACCTA GCCAGACGGT TCATCGCTCT GCGTGACGAC CTCGACAGGG TCGATGGCTC GGGCACTCGA CCGGTGACGA TACCTCCCGG TTGGGACCAT AGCGTCGACA TAACACACGG CGACGTGGAG CGACGCCGGC AGCTAGCCGA CGCATTCGAC CAGACGATCG GCGAGATCCG GGACCTGCCG GCCTTCGCGC GTTTCCTGCG TCCACCACCG GTGCAGGACC TGACGGCGGC TGCGGCGAGC GGTCCGGTCG TCGTCGTCAA CGTCTCCCGG TTCGGCTCGC ACGCGCTGAT CCTGACCACC GGAGGTGTCC TGGAGCCGGT GCCGCTGGTG GCTCTAACTA CCGAGGCCGT TTACGACCAG GCGGACGGAT TCCTCGCAGC CCTCGACGAG GTGTCCTCGC CGGACGGAGG CGCCGGAGGT TGGGGTGCCG CGCAACGGCG GCTGACGGAC ACGCTCGGCT GGCTGTGGGA CGCCGCCACC GGCCCGGTCC TGGACCGGTT AGGCATCACC GGACCGCCCC AGAAGGACGA GGAGTGGCCG CGGCTGTGGT GGTGTGTGTC CGGACTGCTG TCGTTCCTGC CATTGCACGC GGCCGGCCAC CACGCCACCC GCTTCGATCC CGCCCCGAAA ACGGTGGTTG ACCGAGTGGT GTCGTCCTAC ACCCCGACCA TCCGGGCGCT GATCTACGCC CGCCGGACCC ACTCAGGCGA CAGGGACGTC GACCGAGGAC GGCTCGGTTC GGACAGCCGG CTGGTGGTGG CGATGCCGCA CACCCCTGAG GCCGCTGATC TTCCCGGTGC GGATGCGGAG GTCGCCATGC TCCAGCAGCG TTTCCCGGAC CGGACCAGCA CGCTTGTCGG ACCTCAGGCC ACCCGCGAGG CGGTGCTTGC CGCGCTGCCC ACGGCGGGGT GGGCGCATCT CGCCTGCCAC GGGTCGAGCG ACCCCAGTCA CCCTTCTGCC AGTCGGCTGC TCCTCCAAGA CCACCGGCAG CAGCCGTTGA CCGTGGTCGA CGTGGCCCGG CTGCGTCTGG ACGATGCTCA GCTGGCGTTC CTGTCGGCCT GCTCGACAGC CCGCCCAGGT AACCGGCTGG CCGACGAAGC GATCCACCTC GCCTCGGCGT TCCAGCTAGC CGGCTACCGA CACGTGATCG GCACCCTGTC GCCGATCAAT GATCGGCACG CCGCGACCCT CGCCCGCGAT ATCTACACCG CCCTCGATGA CGCCGACGGC GTCATCGACG CAGCCGCCGC GCTGCATGCC GCGACCCGCC GGCTGCGCAA CCGATGGGCA CACATGCCGT CGGTGTGGGC GTCGCACATC CACAGCGGCG CCTGA
|
Protein sequence | MEQFRARLDR RSPVLTARLQ ITAMRAEELM RNTVRTGDRA GLTAAIDLHR QALDICPSDH PAQAAILANL AAALYVRFDW SGSRADLDEA VVAGRAALAA HRPGDPNLAG WLSNLSAALR TRSERWGSRA DLNEAISVGR HAVAASPPDH PETIKRLGNL AAALRIRSEW TGSEADLDEA VSAGREALAV SSHHPERTTM LANHVAALRL RSERTGSQAD LDEAIRAGRA ALAAWASGDP AKVALVTNLG TALQARFELL GNRADLDEAI GAARDAVTAS PPDHLDRATA VANLGLALRI RSQRTGSEAD LDEAVSTGRA ALAASPPDHP HRAGWLSNLA AALLIRSEWT GSQADLDDAV SAGRDAVASS PPDHPDRGTR LANLSRALLT RFELLDSQAD LDEAVTAGRD AVASSPPDHP GMASYLSNLS RALLTRFELL DSQADLDEAV TAGRDAVATS PPTHPYRAAW LSNLGIALLT RFERSPSQAD LDEAVMACRG ALDASPPDHP YRAAWLSNLG TALSLRFEQY GDQADLDEAI AASRAAVAVE VASPRLRARA ARGWGHTAAD GMRWDEAVAG FAAAVDLLGR VAPRSLARGD QEHLLAELGG LGSQAAACCV HAGLPERAIE LFEQGRGVLL GQALDTRTDL TALAEQFPDL ARRFIALRDD LDRVDGSGTR PVTIPPGWDH SVDITHGDVE RRRQLADAFD QTIGEIRDLP AFARFLRPPP VQDLTAAAAS GPVVVVNVSR FGSHALILTT GGVLEPVPLV ALTTEAVYDQ ADGFLAALDE VSSPDGGAGG WGAAQRRLTD TLGWLWDAAT GPVLDRLGIT GPPQKDEEWP RLWWCVSGLL SFLPLHAAGH HATRFDPAPK TVVDRVVSSY TPTIRALIYA RRTHSGDRDV DRGRLGSDSR LVVAMPHTPE AADLPGADAE VAMLQQRFPD RTSTLVGPQA TREAVLAALP TAGWAHLACH GSSDPSHPSA SRLLLQDHRQ QPLTVVDVAR LRLDDAQLAF LSACSTARPG NRLADEAIHL ASAFQLAGYR HVIGTLSPIN DRHAATLARD IYTALDDADG VIDAAAALHA ATRRLRNRWA HMPSVWASHI HSGA
|
| |