Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3501 |
Symbol | |
ID | 3905235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4174895 |
End bp | 4178572 |
Gene Length | 3678 bp |
Protein Length | 1225 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637880823 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_482583 |
Protein GI | 86742183 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.480314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.404449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCCC CGGGATCCGC CTCCCCGTCA TCGGGCGTTG TTCCGTCCGG CCAGGGGGAT GTCTCCCCGG GACCGAATCC GGTCACCCGA TCCCAAGGGG GACCGTCGGC TGGCCCACAC ACCAACCCCG CCCCACACAC CAACCCCGCC CCACGCGCGC ATCCCGGCCC GAACCAGGTC CCCGGCCCGA AGCACGCTCC CGGCCCGGAG ACCGGCCCCG GTCAAGACCG GGGAGCCGAC CCCGAGCTAC CCGGACTACC CGAGGGCTGG CCCATGGTCG CCCGGCCGGC CGCTTCGGCC GTGACCGCCG AGCCGGTCAC GCGGGACGTC ATCGACGGCA CCGAGCCGCG CCGCGCGCCG GAGGCGCCAG GGATGCGCCG GTCCGATCCG GCGGGGCTGC CGGGGCGTGT CCGGTCCGCT TTCCCCCTGC CCGGACACGC CCCGGGAACC GACCCCGCGG CCACGTTCGT GATGCTGGAC GCGCTGTTCG CGTGGGCCCC GGTCGGTCTC GCCCTGCTCG ACCGGGCCGG GCGGTTCCTG CGGGTCAACG ACACACTTGC CCGGTTCGAC CGCCGGCCGG TCCACGAGCA CCTCGGCCGT ACCGTCTCCG AGCTCCTCGG CGACACCGGC CAGGAACTCG ACGCTCTGCT GGCACGGGTA CTGCGCACCG GCGAGCCGGT GGTGGACTTG GAGGTCATGG TCGCCACCGA TGGTTCCGGG CCGCCGCAGA CGTGGCTCGC CAGCTGGTAT CCGGTGAACG ACCCTCAGGT CGGGCTGGTG GGCGTGGCGT TCGTCGCGAT CGACGCCAGC GGGACGCGGG CGGTCGAGGG GGAGCGGGCG CGGGCCGACG CACGTTACCT CGGCCTCGTG GACGCCGCCG CGGTGGACGT GTTCCACGCC GAAGGCGACG GGGCACTCGA TGCCGATCTG CCCCGCCTGC GGGCCTTCAC CGGGCGGCAT CCGGCGGAGC TGGCCGGGTT CGGCTGGCTC GGCGTGGTCC ACCCCGACGA CCGGGAGCGG GTCGGGCGCG CCTGGCACGG GGCGATCGAG CACGGTGAGA CCTTCGAGGC CGAGTTCCGG ATCTCCGGTG GCGGAGGCGA CCGCACCGCC ATGCGGGTGG TCGAGGCCCG CATCGTTCCC ATGCCGGCGG CCGGCCGGCC GAACAGCCGG CCCAGCGAGT GGCTCGGGGT GATCCGCGAC CTCACCGAGG TACGCGCCGC CGAGGCCGAC CGGGCCACGG CCGACCAGCG GGCCCGGATC GCGACGGAAC GGGCCGAACA GACCGCGACG TTGGCCGTGG CGCTCGCCCG GACCCTGACC GTGGACGACG TCGTCGCCAC CGTCCTCGAC GTCGGGGGGC GGATGGCCGG GGCCGCCGGC CGGGGCGTCG CGCTCGTGGA CGAGGCGCAC GACCGGCTGC TCTTCCACGC CCCGCCGGGT CCCGCCGACG GCCTCGCCCG CTGGTCGGAG GTGGCGCTGG GGGCGGTACA TCCGGTGGCG GAGGTCATGC GGGGCGGTCG TGCGCTGTTT CTCGTCGACC GCGACGAGCT GCTCGCCCGC TGGCCGGTGC CGGAGGTCGC CGACGTCGCC GCCGCGGCGG GTGAGCATGC CTGGGCGATG CTGCCGCTGG CCGTCGGGGG CGGCGCCCCC TTCGGCGTGG TGACCTTCGG GTTCCGCCAG GCGCGGGAGT TCACCGCCGC TGATCAGGCG TCCCTCATCG CGATCGCCGA CGCCTGCGCG CAGGCCCTGG AACGGGCCAC CGGCTACGAA CAGCTCGCCG CCGACGCCGC GCGCGGTCAT CGGACCCTGG CTGCGACGCG CGAGGCGCAG GCCGCCCTGG CACTCGCCGA TCGACGCCTT CAGCTGCTGG GGCGAGCTAC CGGGATCGTG GCCGCGGCCG TGGAGCCTCC CGTCGCCCTG CGCTCCCTGG CCGAGTTGAT CGTCTCGGAG GTCGCCGACC TGTGTGTTGT CCAGCTCGTC ACTGGCACAC CCGCTCCCGT CCCGGCGTCC TGGAGCGCTG CCGCGGTGTC CGTCCGGGCC GGGGACGCGG CCGGGGATAA GGCCGGGGAC GCGGCCGGGG ATAAGGCCGG GGCCGAGGAG ACGGCGGGGG ACCGGGCGGC AGAGCCGGTG CCCGAGCTGC GTCCACTCGT CGTCCTGGCC CGTGACGGGC TTGGCACGGT GCCTCCGTTC GCCTCGGGGG CCGGCGCGGC GACGTCGCCG GCGAGTCCGT TCGCCCGGGC CGCCCGCCGG GGCGAACGGC TGATCGTCGC ACTGACAGCG GGCGAGTGGG ATCCGCCGGC CGACGCCGAG CGGTGGATCC GCCAGGTGGG GGCCCACACG ATGGCTGTCG TACCCGTGGT ACGGGTCGGC CACGTCGTCG CCGTACTGAG TGTGACCGCC GTTGCGGATC GACCTCCGTT CACCGAAGCG GATCTGCTCT TGCTGACCGA ACTCGCCGCC CGGGTGGGGG TCGTCCTGGA CCGGATCGAT CGGGGGGCCG CCGAGCGCAG CAATGCGCTG GCACTGCGCG AGGCGTTGCG CGGTTCTCCA CCCGCCGTCC CGTCCGGGCT CGAGGTGGCC ACCCGTTACC TGCCCGGCGG GGTCGATGAC GACGCCGGTA GCGACTGGTT CGACGTGATT GACCTGGGAG CCGGGCGAGT CGCCCTGATG ATCGGCAACG TGATGGGCCG AGGGATCCGG GCGACCGCGG TGATGGGGCA GCTGCGCGCG GCGGCCCGCA CCTGCGCCCG TCTCGATCTT CCCCCGGCCG AGGTGCTGAC GCTGCTGGAC GGCATCGTCG CGGACCTGCC CGGGGAGGAG ATCGCCACCT GCATCTACGC GGTCGTTGAG ATCGACAGTG GGGTGCTGAC GCTGGCGAGT GCCGGGCACC CGCCGCCCCT GGTCGTCGCG CCGGACGGGT TGGTCTCCAG GCTCTACATG GCGGTGGGAT CTCCACTCGG GGTGGCCCGG TCGGACGTGA CCGAGTACAC GGTGCGACTG GGACGGGGAT ATCTGATCGC CCTGTTCACC GACGGGCTCG TCCGGGGACG TGCGCGCGAC CTCGACGCCG GGGTCTCGCA GCTCGCGGCC GCGCTCGCCC GCGCCAGCGA CAGGTTCACC GCGAATCTGG ACGACCTGGT CACCACGGCG TGTGCCGGTC TCGGTCCCGC CGTCGCCCCC GGCCCAGTGG GTTCCGGGGC GGCCGACGGG GCGGCCGACG AGGTGGCGGC CGATGACGTC GCGCTTCTGT TCGCCCGGTT GCCCGTTGAA CCGACGGCCG CGGCGGCCCT CCTGGACGTC ACCTTCGACG GTGCGGCGAG CCTGCGCGCC GTGCGGGCTC AGGCCAGGCT CGCGCTGGAG AACGCGCCGC TGGCCTCGGA AGTCGTCGAC ACCATCGTTC TGGTGCTGTC GGAGCTGGCG AGCAACGCGG TGCGGCACGG TCGCCCACCA CTGTCGGTGC GGCTGCGGCT GCTGGGCGAC CGGGCCGTCG TCGAGGTCGC CGACGGTGGC GGCCGGGTGC TGCGGCGGCG CCACGCCGCG GCCGAGGATG AGGCCGGCCG CGGGCTCGGC CTGGTCTCCC AGCTCGCCGT TCGGCATGGC GTCCGTCCGG TCCCCGACGG GAAGGCCGTG TGGGCCGAGA TCGACCTGAC CGGGACGACC CCGCCCGAAC CGGACTGA
|
Protein sequence | MSPPGSASPS SGVVPSGQGD VSPGPNPVTR SQGGPSAGPH TNPAPHTNPA PRAHPGPNQV PGPKHAPGPE TGPGQDRGAD PELPGLPEGW PMVARPAASA VTAEPVTRDV IDGTEPRRAP EAPGMRRSDP AGLPGRVRSA FPLPGHAPGT DPAATFVMLD ALFAWAPVGL ALLDRAGRFL RVNDTLARFD RRPVHEHLGR TVSELLGDTG QELDALLARV LRTGEPVVDL EVMVATDGSG PPQTWLASWY PVNDPQVGLV GVAFVAIDAS GTRAVEGERA RADARYLGLV DAAAVDVFHA EGDGALDADL PRLRAFTGRH PAELAGFGWL GVVHPDDRER VGRAWHGAIE HGETFEAEFR ISGGGGDRTA MRVVEARIVP MPAAGRPNSR PSEWLGVIRD LTEVRAAEAD RATADQRARI ATERAEQTAT LAVALARTLT VDDVVATVLD VGGRMAGAAG RGVALVDEAH DRLLFHAPPG PADGLARWSE VALGAVHPVA EVMRGGRALF LVDRDELLAR WPVPEVADVA AAAGEHAWAM LPLAVGGGAP FGVVTFGFRQ AREFTAADQA SLIAIADACA QALERATGYE QLAADAARGH RTLAATREAQ AALALADRRL QLLGRATGIV AAAVEPPVAL RSLAELIVSE VADLCVVQLV TGTPAPVPAS WSAAAVSVRA GDAAGDKAGD AAGDKAGAEE TAGDRAAEPV PELRPLVVLA RDGLGTVPPF ASGAGAATSP ASPFARAARR GERLIVALTA GEWDPPADAE RWIRQVGAHT MAVVPVVRVG HVVAVLSVTA VADRPPFTEA DLLLLTELAA RVGVVLDRID RGAAERSNAL ALREALRGSP PAVPSGLEVA TRYLPGGVDD DAGSDWFDVI DLGAGRVALM IGNVMGRGIR ATAVMGQLRA AARTCARLDL PPAEVLTLLD GIVADLPGEE IATCIYAVVE IDSGVLTLAS AGHPPPLVVA PDGLVSRLYM AVGSPLGVAR SDVTEYTVRL GRGYLIALFT DGLVRGRARD LDAGVSQLAA ALARASDRFT ANLDDLVTTA CAGLGPAVAP GPVGSGAADG AADEVAADDV ALLFARLPVE PTAAAALLDV TFDGAASLRA VRAQARLALE NAPLASEVVD TIVLVLSELA SNAVRHGRPP LSVRLRLLGD RAVVEVADGG GRVLRRRHAA AEDEAGRGLG LVSQLAVRHG VRPVPDGKAV WAEIDLTGTT PPEPD
|
| |