Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0658 |
Symbol | |
ID | 5669075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 770370 |
End bp | 772220 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239585 |
Product | IucA/IucC family protein |
Protein accession | YP_001505023 |
Protein GI | 158312515 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4264] Siderophore synthetase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.577175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.913499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCACCG GGCTTCCTAG AGCCGCCGCG GGGCGGTGGC GCGCTGCCGG GCTGGCGCTG CTCACCCGGC TGATCGCGGA GCTCGCCTAC GAGGAGCTGC TCGTTCCGCG GGCCGAGCCG GCGACCCCGC CGCGCACCCC GCCGCCGGTC GGGCGCGGCG CGCCGGCGCC GTACCGGATC GAGTGCGGCC CCGTCACGTA CACGTTCACC GCCCGGCGCG GCACGTTCGG CACCTGGTGG CCCGACCCGG CGACGCTGCG CCGCGACGGC GAGCCGGCCT GGGACCCGGC CCGGTTCCTG CTCGACACCC GGCAGGGCCT GGGCTGGTCC GGGGACGTCC TGACCGATGT CGTGCGCGAG GTGACGGCCA CCCAGCGCGC CGACGCCGAG ATCCTGCGGA CGGCCCTGCC GGCCCGCGCG CTCGCCGACC TCTCCCACCT TGAGCTGGAG GGGCACCAGA CCGGCCATCC CTGCATGATC GCGAACAAGG GGCGGCTCGG GTTCGACGCG GCCGACGCGG CGCGGTACGC CCCGGAGTCG CGCCGCCCGT TCCGGCTGCG CTGGGTGGCC GCCCACGCCG AGCTGGCGCG GCTCGTCACC GGCCCCGGGC TGGGCGCCAC CGGGCTGCGC GAGGCCGAGC TCTCCGCCGC CACCAGAGCC GAGTTCGGCG CGGTCCTGCG CGCCGCGCTC GAGCCCACCG CCGGAGCGGA CGCCGCCGCG TTGGCCGAGC GGTACGTGTG GCTGCCGGTG CACCCGTGGC AGTGGGAGAA CGTGGTCGCG CCGATGTTCG CCGGCCCGCT CGCCACCGGG CAGCTGGTCG ACCTCGGCGA GGCGCCCGAC CGCTACCTGC CGTTGCAGTC CGTGCGCACG GTGGCGAACA TCGACACCCC GGGCCGGCGC GACGTCAAGC TCGCGCTGAT GATCCGGAAC ACGCTGGTGT GGCGCGGGAT GTCCGCCGCG GACGCCACCG CCGGACCGGC CGTCTCGGCG TGGCTGGTGT CGCTCGCCCA CGGTGATCCG GTGCTGCGGG CCACCGGCGT CGTCGTGCTG CCCGAGATCG CCGGAGCCAC CGTCGCGCAT CCCGCCTTCG ACGCGGTGCC GGACGCCCCC TACCGGCTGC ACGAGCTGCT CGGGGTGCTC TGGCGGGAGC CGGTCGCGTC CTTCCTGGCC CCCGACGAGC GGGCCCGGAC GATGGCCTGC CTGCTCACCG TCGGCGCGGA CGGCGAGTCG CTGGCCGCCG AGCTCGTCCG CCGCTCCGGG CTGGAACCGG CGCGGTGGCT CGCCGCGCTG CTGGACGCGC TGCTCCCGCC GCTGCTGCAC TACCTGTACG TGTACGGGGT GGCGTTCACC CCGCACGGCG AGAACGTGAT CTGCGTGTTC GACGCCGGTG GGATCCCGCG GCGGATCGCG GTGAAGGACT TCGGCGCGGA CATCGACCTC GTCGAGGGCG AGTTCCCCGA ACGGGCGGCG ACGGACGGCG GCGCCGGCGC GCTGTGCCGC CACTGGCCGG GCCGGATGCT CGCCCACTCG GTGCTCTCGG CAGTGTTCGC CGGCCACTTC CGCTACTTCT CGGTGATCGC CGCCGACCAT CTCGGCGTGC CCGAGGAGGA GTTCTGGTCG CTGGTGCGCG GCGCCGTGGA GGACTACCAG GAGGCCCACC CGGAGTACGC CGAGCGGTTC GCCGCGGTCG ACCTGCTGAC CCCGTCCTTC GAGCGGGTCT GCCTCAACCG GGAGCAGTTC GCCGGTGCCG GCTTCCACGA CCGCTCCGGG CGGGACGCCC AGTTCGACGT CCTGCACGGC ACGGTGGCGA ACCCGCTGAT GCTCGCCCCA CCGCGGCGGA GCCACGGGTG A
|
Protein sequence | MTTGLPRAAA GRWRAAGLAL LTRLIAELAY EELLVPRAEP ATPPRTPPPV GRGAPAPYRI ECGPVTYTFT ARRGTFGTWW PDPATLRRDG EPAWDPARFL LDTRQGLGWS GDVLTDVVRE VTATQRADAE ILRTALPARA LADLSHLELE GHQTGHPCMI ANKGRLGFDA ADAARYAPES RRPFRLRWVA AHAELARLVT GPGLGATGLR EAELSAATRA EFGAVLRAAL EPTAGADAAA LAERYVWLPV HPWQWENVVA PMFAGPLATG QLVDLGEAPD RYLPLQSVRT VANIDTPGRR DVKLALMIRN TLVWRGMSAA DATAGPAVSA WLVSLAHGDP VLRATGVVVL PEIAGATVAH PAFDAVPDAP YRLHELLGVL WREPVASFLA PDERARTMAC LLTVGADGES LAAELVRRSG LEPARWLAAL LDALLPPLLH YLYVYGVAFT PHGENVICVF DAGGIPRRIA VKDFGADIDL VEGEFPERAA TDGGAGALCR HWPGRMLAHS VLSAVFAGHF RYFSVIAADH LGVPEEEFWS LVRGAVEDYQ EAHPEYAERF AAVDLLTPSF ERVCLNREQF AGAGFHDRSG RDAQFDVLHG TVANPLMLAP PRRSHG
|
| |