Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1241 |
Symbol | |
ID | 5669654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1491422 |
End bp | 1495720 |
Gene Length | 4299 bp |
Protein Length | 1432 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641240173 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_001505601 |
Protein GI | 158313093 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00402812 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACGC CGTACTCCGC GCAGCCCGCC GGCCGGCCGT CCGCCGGCCC TGACGTGCAG GCCCATGACG CCACTGGCGA CGGGCAGTCC GGGACGGGTG GTCCGGACCC GACGGGTCCC GGCGGTCGCG ACGGTCTCAA CGGTTCCGGC GGTCTCAACG GTTCCGGCGG TTCCGGCGGT TTCGGCGGTC TGAGCGGTCC GGGTGGTCTC GGCGGCCCGG GTGGTCTTGG CGGGTCGGGG GACGTCGGCG GGCCGGGTGC GCTTGGCGGC CTGGGCGGGC TCGGCGCTCC CGGTTCCGAG TGGGCGGGTG GGAACAGCTG GGCCGGCGGG CCGGTCGGCT GGCCTTCTTA CGCTGACGTC ACCGGCGGGG CGATGCCCGC CAGCCCGCGG GGGACGCCCT CCGAATGGGA GCCCCTCGTG GGGGACGGCG CCGCGCCGGC GCCGGCGTCC TCGATCGGCC TGCCGCCGCC CCCGCCCGGC TCGCCGCCGC CGGGCGAGCC GCCGTCCGCC GGGTCCGGGG GAGTGGGACA ACTACCCGCC GCAGGCGTTC CCGCCGGTTC ATCGCCGGAC ATCGTGCCGC TGGCCGGGCT GATCCCGGGG CCGCCGCTGT CGGCGTCACC GCCCGCGGGC GAGAACACGC CGGCGCCGCC GGTGCCCCCG CCGGGGGAGT CCGCTCCGCC GGCACCACCG CCGACCGTGG CCGCGCCGTC GTCCGAGTTC CCGCCATCGG GGTCCACTCC TGCAGGGCAG GCCGCTGCCG GGCAGGAACC GCCCGGACAG GAGCTGTCCG GGCCCGCGGT GGACGGGCCT GCGCCGTCCG GGCGGGTTCC GTCCGCGCAC GGGCAGGCCG GGCCTGTGGC GTCCGCGTCG CCCGGTGGCG AGCGCGGTTT CCCCGGCGAC TTCGGGCCCG GTTTCGCGGT CACACCCGGC TTCCCACCCG GGCAACCGCC GCACGGGTCG GCGTCGTCTG ACTTTGGTAC GCCCTGGGGG CCGCCGGCCC TCGAACCCGG CCCCGGGGCC GGCCTGGGCG GTTCGGGCGG AGTCGGTGCC ACCGGGGCGG GCCACGACGG GCCGGTCCCG GCCGGGCCGT CCCACCCGGG CGGTGGCGGG GCGTCCGAGG GCGCGGATCT CGCCGGGCTG GCGCTGCGGG AGGCCCTGTT CGACCAGGCC CCCGTCGGCC TCGCCCTGTA CGACGCCGGC GGCCGCTACA TCCGGGTGAA CGACGTCCTC GCGCGGCTCA ACGGCCGCCC GGCCGCCGAG CACCTGGGCC GCACTATGTC CGAGCTCCTC GGGGAGATCG GCCAGGAGAT GGACGGCCTG CTGTCCCGTG TCCTGCGCAC GGGCGAGGCG GTGACCGACC TCGAGATCGG AGTCGCCACC GGCGGCGCCG GCCCGAACCA GACCTGGTCG GCGAGCTGGT ACCCGGCCAC CGACCGGCTC GGTGCCCGCG CCGGCGCGGT CCTCGTCGCC CTCGACGCGA CCCGCGCGAA GACCGCCGAG CGGGACCACG CCCGCGCCGT CGGCCGGGAG CGGGCGCTGG GCGAGGCCAC CGCGGCGGAC GTCTTCCACG CCGGCGAGGA CGGCGCGCTC GACACCGACC TGCCCCGCTG GCGCGCTGCC ACCGGCCAGA ACGGCGCGCA GGCCGCCGGT TTCGGCTGGC TGGACGCGAT CCATCTCGAT GACCGGGAGC GGGTCGCCCG GGCCTGGCAC GGCGCGATCG AGCGCCGCGA GCCGTTCGAC GCCGAGTTCG GTGTCCCGGA CGTCACCGGG ACACACCGGA CGATCACCGC CCGGCTGGTG CCCGTGGTGG ACGGGCCGCA GGTCGAGTGG ATCGGTGTGC TGACCGACGT CACCGAGGCC CGTTACGCCC GCGACCCGCG ACCCGCGCAG GACGGCGTGC CGGACGAGGC AACCTGGCGC CGTGAGCAGT CGTGGCGGCT GACCTCCGCC CTCGGCCGGG CGGTGACCGT CGACGACGTC GTCGCGGCGG TGCTCGACAG CGGCGGGCGG GCCGCGCGCG CCGTGGGCCG CGGCGTGGCA CTGATCGACG AGAGCGACGA CCGGCTCCGG TTCCGCTCCC TGGCGGGCAC CTCGGACCAG TACGCCGAGC GCTGGTCGGA GGTCGGGCTC GGCGCCATCC ACCCGCTCGC CGAGGTCATC CGCGGCCGGC GGCCGCTGTT CCTGCGTAGC CGCGACGAGC TCGGCGGGCG CTGGCCCGTT GCCGACCTGC TCGGCGCGGT CGAGTCGGGG CCGGAGCACG CCTGGGCGCT GCTGCCGCTC GCGACGACCG ACGTGCCCTT CGGCGTCCTC CACCTCGGGT TCCCCTCGCC GCGCGAGTTC GACACCGACG ACCAGGCGTT CCTGATGGCG ATGGCCGAGC AGTGCGCGCA GGCTCTCGAG CGCGCGACCC TGTTCGAGCG GCTGGCCGCC GACACCGCGC GCAGCCGGCG GGACCGCGAC GAGGCCGAGG CCGAGCTGAC GGCGGCGCTG GCGGCGGCGG ACGTCCAGCA GTCGGCGGCG GGCGCCGCGC GGCGCAGGCT GGAGCAGCTC GCCCGTGCGG GCGAGGCGGT GGCCTCGGCC ACCAGCCCGG AGCGTGCCCT GTCGGCGCTG GCCGCGGCGG TTGCCGGGGA GATCGCCGAC GTCTGCGTCA TCCACCTGCT GCCGCCCGCG GACGAGACCG AGACGTCAGC AGCGCCCGGG TCGCCCGGAT CGCCGGGGTT GCCTGGGTCG CGTCCGGCGG TGTTCGTGGC CCGTGACGGA CTCACCGCCG CGGCGCCGCC CGGTGCTCCC GCGCTCGTCC TGCCGGGGGC GGGCCCGCTC GCCGCGGTCG CCGCCGGCGC CGCGCCCGTG CTCCTGCCCG CCAGCCCCGC CGCCCCGTTC CTGGACGGCG CCCTGACCCC CGAGCTGGCC CGCTGGGTAC GGGAGGCCGA GGGGCACAGC GCCGCGGCGG TGCCGATCAC GTTCCGGGGT CGCCCGGCCG GCGTGCTGAC GGCGATCGCC GCCGGTGAGC GCCCCGCCTT CACCACGGAC GACCTGCCCT TCCTGACCGA GGTCGCGGCC CGCACAGCGC CCGCCCTGGA ACGCGCCGAG GCCGGTGGCC GCGACGGTGG CGACGCGCTG GCGCTCCAGA AGGCGCTGCT GCCGCGCACC CTGGCCCCGG CGGGCCTCGA CCTGGCCACC CGGTACCTGC CCGCGGGCGG GGACGACCAG GTCGGCGGCG ACTGGTTCGA CGTCATCGAC CTCGGCGCCG GGCGGGTGGC GCTGGTCATC GGCGATGTGA CGGGACGCGG TGTGCGCGCT GCCGCTCTCA TGGGGCAGCT CCGCAGCGCA GTGCGCACCT GCGCCCGGCT GGACCTGCCA CCGGCCGAGG TCCTCATCCA ACTCGACGGG CTCGTCGCCG ACCTCGGCGA GGACCTGATC GCCACCTGCA TCTACGCCGT CGTGGAGACC GACACCGGAC TGCTGACCCT CGCCAGCGCC GGGCACCCGC CGCCCCTGGT GGTCGCCCCG GACGGCCTGG TCTCCCGGCT CTACATGGCG GCTGCGACCC CGCTCGGCCT CGCCTGCGAC GCGATGACCG AGTACACCGT CTCGCTCGGG CCCGGCTCGC TGCTGGCCCT GTTCACCGAC GGACTCGTGC GCGGGCGCGG GCTGGACATC GACGCCGGGG TCTCGAACCT GGCCGCCGTC CTGGCCCGCC CGGCCGAGGA GTGGGACGGC CGGCTCGACG ACCTGGCCCG GGCCGCCTGC GCGGCGCGGG TCGGCGCGGG CGGCCCGCCG CCCGGCGACG ACGTCGCGCT GCTGCTGGCC CGGCTGCCCG TGCCGGACCC GCTGGCCGAG CCGCTGGACG TCTCCGCCGA TCCCTCTGTG GGGCTCAGCC AGGTCCGCGC GCAGGTGCGG GTGGCGCTGG AGAACGCCCT GACAGAGCCG CCGGTGATCG ACACCGTGGT GCTCGTGCTC TCCGAGCTCG TCGGCAACGC GCTGGTGCAC GGGCGGGCCC CGCTGTCGGT CCGGGTACGG CGGATGGCGG CGTCGAGCCC GGCGGGTGCC GGTGCGGCGG GCGGCGGTTC GGCAGGTAGC GGTTCGGCAG GTAGCGGTTC GGCAGGTAAT GGTGCCCGGC GCATCCTGGT CGAGGTGGGT GACGCCGGCG GCCGGATGCC CCGCCGCCGC CGGGCCGGTC CCGACGACGA GACCGGCCGT GGGCTCGATC TGGTCGGCCG GCTCGCGCTG CGCTGGGGCG TGCGGCCCGT CGGCGACGGC AAGCTGGTGT GGGCGGAGAT CGACCCGAGC CGGGTCTGA
|
Protein sequence | MTTPYSAQPA GRPSAGPDVQ AHDATGDGQS GTGGPDPTGP GGRDGLNGSG GLNGSGGSGG FGGLSGPGGL GGPGGLGGSG DVGGPGALGG LGGLGAPGSE WAGGNSWAGG PVGWPSYADV TGGAMPASPR GTPSEWEPLV GDGAAPAPAS SIGLPPPPPG SPPPGEPPSA GSGGVGQLPA AGVPAGSSPD IVPLAGLIPG PPLSASPPAG ENTPAPPVPP PGESAPPAPP PTVAAPSSEF PPSGSTPAGQ AAAGQEPPGQ ELSGPAVDGP APSGRVPSAH GQAGPVASAS PGGERGFPGD FGPGFAVTPG FPPGQPPHGS ASSDFGTPWG PPALEPGPGA GLGGSGGVGA TGAGHDGPVP AGPSHPGGGG ASEGADLAGL ALREALFDQA PVGLALYDAG GRYIRVNDVL ARLNGRPAAE HLGRTMSELL GEIGQEMDGL LSRVLRTGEA VTDLEIGVAT GGAGPNQTWS ASWYPATDRL GARAGAVLVA LDATRAKTAE RDHARAVGRE RALGEATAAD VFHAGEDGAL DTDLPRWRAA TGQNGAQAAG FGWLDAIHLD DRERVARAWH GAIERREPFD AEFGVPDVTG THRTITARLV PVVDGPQVEW IGVLTDVTEA RYARDPRPAQ DGVPDEATWR REQSWRLTSA LGRAVTVDDV VAAVLDSGGR AARAVGRGVA LIDESDDRLR FRSLAGTSDQ YAERWSEVGL GAIHPLAEVI RGRRPLFLRS RDELGGRWPV ADLLGAVESG PEHAWALLPL ATTDVPFGVL HLGFPSPREF DTDDQAFLMA MAEQCAQALE RATLFERLAA DTARSRRDRD EAEAELTAAL AAADVQQSAA GAARRRLEQL ARAGEAVASA TSPERALSAL AAAVAGEIAD VCVIHLLPPA DETETSAAPG SPGSPGLPGS RPAVFVARDG LTAAAPPGAP ALVLPGAGPL AAVAAGAAPV LLPASPAAPF LDGALTPELA RWVREAEGHS AAAVPITFRG RPAGVLTAIA AGERPAFTTD DLPFLTEVAA RTAPALERAE AGGRDGGDAL ALQKALLPRT LAPAGLDLAT RYLPAGGDDQ VGGDWFDVID LGAGRVALVI GDVTGRGVRA AALMGQLRSA VRTCARLDLP PAEVLIQLDG LVADLGEDLI ATCIYAVVET DTGLLTLASA GHPPPLVVAP DGLVSRLYMA AATPLGLACD AMTEYTVSLG PGSLLALFTD GLVRGRGLDI DAGVSNLAAV LARPAEEWDG RLDDLARAAC AARVGAGGPP PGDDVALLLA RLPVPDPLAE PLDVSADPSV GLSQVRAQVR VALENALTEP PVIDTVVLVL SELVGNALVH GRAPLSVRVR RMAASSPAGA GAAGGGSAGS GSAGSGSAGN GARRILVEVG DAGGRMPRRR RAGPDDETGR GLDLVGRLAL RWGVRPVGDG KLVWAEIDPS RV
|
| |