Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4068 |
Symbol | |
ID | 4894980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | - |
Start bp | 70 |
End bp | 4755 |
Gene Length | 4686 bp |
Protein Length | 1561 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640110470 |
Product | large exoprotein involved in heme utilization or adhesion |
Protein accession | YP_001041782 |
Protein GI | 126464806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 0.516852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT TCAGCAGCAA GACCACAGCA GAGATCGCGG CCCTGACGAG CGCCGAGGTC GCCAGCATGT CGAGCCAGGA TCTGGCCGCC CTCTCCACCG CCCAGATCGC GGCGCTCACC GCCCAGCAGA TCGGCTGGGT CAAGGCGGCG TCGCTGAAGG GGCTGGGGGA TGCGCAGGTG GTGGCGCTGA CGACGGCGCA GGCGGCGGCG CTCGGCTCGG CGCAGCTGGC CGCGCTGACG ACGGCGCAGG TGGCGGCGAT GGAGACGGCC GATCTCGCGG CGCTCTCGGC CACGGGGGTG GCGGGGCTGA CTTCGGCGCA GCTCGGGGGG CTCTCGACCG GGCAGGTGGC GGCCCTCACC ACGGCGCAGG TCGCTGCCCT GTCCAGCGTG GCGGTCAAGG GTCTGGGCTC GGTCCAGGCC TCGGGTCTCA CGACGGCCCA GGTGGCCGCC CTGTCGACCG CCCAGCTCAA GGCCTTCTCG ACCGCGGGCA TGACGGGGCT CGGCACGGCG CAGATCGTGG CGCTCTCGAG CGCGCAGGCG GCGGTGCTCG GCTCGGCACA GGTCGCGGCA CTCACGACGG CGCAGGCGGC GGCGATGGAG ACGGCCGATC TCGCGGCCCT CACCAGCGTG GCCGTGAAGG GGCTGAGCTC GACCCAGGTG GGCGCGCTGA CCACGGCGCA GGTGGCGGCG CTGACCACGG GACAGCTCGG CGCGCTCTCG ACCGGGGCGC TGAAGGGCCT GACCACGGCG CAGGTGGTTG CCCTGACCAC GGCGCAGGCG GCCGGGCTCG GCTCGGCGCA GGTGGCGGGC CTGTCGAGCA CGCAGATCGC GGCGCTGGAG ACGGCGGATC TGGCCGCCCT CTCCACGGCG GGGCTGAAGG GTCTGGGCTC GGCGCAGGCG GGGGGCCTGA CCACGGCGCA GGTGGCGGCG CTCACGACGG CTCAGGTGGG CCAGCTCTCG AGTGCCGCGC TGAAAGGGCT CGGGACGGCG CAGGTGGTGG CACTGACGAC CGCGCAGGCG GCGGCGCTCG GCACGGCGCA GGTGGGCGCG CTCTCGACCG CACAGGTGGC GGCGCTCGAG ACCGTCGATC TCGCGGCGCT CTCGACGGCG GCGGCGAATG CCCTGACCTC GGCTCAGGTC GCGAGCCTCA CGACGGCGCA GGTGGCCGCG CTGACGACGG CGCAGGTCGC GGCGCTCTCG ACGGGGGCGG TGAAGGGGCT GAGCTCGACC CAGGCGGGCG CGCTGACCAC GGCACAGGTG GCGGCGCTGA CCACGGGGCA GCTCGGCGCG CTCTCGACCG GGGCGCTGAA GGGCCTGACC ACGGCGCAGG TGGTGGCGCT GACCACCGCG CAGGCGGCGG GGCTCGGCTC GGTGCAGGTG GCGGGGCTCT CGAGCACGCA GATCGCGGCG CTGGAGACGG CGGATCTGGC CGCGCTTTCC ACGACGGGGC TGAAGGGTCT GGGCTCGGCG CAGGCCGCGG GCCTGACCAC GGCGCAGGTG GCGGCGCTCA CCACGGCTCA GGTGGGCCAG CTCTCGAGTG CCGCGCTGAA AGGGCTCGGG ACGGCGCAGA TCGTGGCGCT GACGACGGCG CAGGCGGCGG CGCTGGGGTC GACGCAGGTG GCCGGGCTCT CGACCGCGCA GGTGGCGGCG CTGGAGACGG CCGATCTCGC GATGCTCTCG ACCGCGGGGG TGAAGGCGCT GAGCTCGACG CAGGTGGGCG CGCTGACGAC GGCGCAGGTG GCGGCCCTGA CGACAGCGCA GGCCGCCCAG ATCTCGACGG CGGCGGTGAA GGGCCTGAGT TCGACGCAGG TGGCGGCCCT GACGACGGGG CAGGTGGCGG CCCTGACCAC GGCCCAGCTC GGCGCACTCA CGACGGCGGC GCTGAAGGGC GTGACCACGG CGCAGGTGGT GGCGCTGACC ACGGCGCAGG CGGCGGGGCT CGGCTCGGCG CTGCTGGCGG GCCTGTCGAG CACGCAGATC GCAGCGATCG AGACGGCGGA TCTGGCCGCG CTCTCCACGA CCGGGCTGAA GGGTCTGGGC TCGGCGCAGG CGGCGGGCCT GACCACGGCG CAGGTGGCCG CCTTCACCAC GGCCCAGGTG GGGCAGCTTT CGACGGCGGC GCTGAAGGGG CTCGGCACCG CGCAGATCGT GGCGCTGACC ACGGGCCAGG CGGGGGCGCT CGGCTCGGCG CAGGTGGCGG GTCTCTCGAC CGCGCAGGTG GCGGCGCTCG AGACGGCCGA TGTCGCGGCG CTCTCGACGG CGGGGGTGAA GGGCTTGGGC TCGGCGCAGG CGGCGGCGCT CGGCTCGGCG CAGGTGGCAG CGCTGACGAC GACGCAGGTG GGCCAGCTTT CGACCACGGC CCTGAAGGGC TTCGGCTCGG TGCAGGCTTC GGGTCTCACC ACGGCGCAGG TGGCGGCGCT GACCACGACG CAGCTCTCGC AACTCTCGAC GGCGGCGGTG AAGGGGCTCG GCACCGCGCA GATCGTGGCG CTGACCACGG GCCAGACGGC AGCGCTCGGC TCGGCGCAAC TGGGCGCCCT CTCGACGGCG CAGGTGGCGG CCTTCGAGAC GGCGGATGCC GCGGCGCTGA CCACGACGGC GCTGAAGGGG CTGACCACCG CGCAGGTGGT GGCGCTGACG ACGGGTCAGG CGGCGGCGCT CGGCTCGGCG CAGGTCGCGG GCCTGTCGAG CACGCAGATC GCGGCGCTCG AGACGGCGGA TCTCGCGGCC CTGACCACCA CGGCGGTGAA GGGCCTGGGC TCGACGCAGG TTTCGAGCCT GACGACGGGG CAGGTGGCGG CGCTCACCAC CGCGCAGGTG GCGGCGCTGA GCACGGCGGC CGTGAAGGGC GTGGGCTCGG TGCAGGCCTC GGGGCTGACG ACGGCGCAGG TGGCGGCGCT GACCACGGCC CAGGTGGCCC AGCTCTCGAC GGCGGCGCTG AAGGGGCTCG GCACGGCGCA GATCGTGGCG CTGACCACGG CCCAGGCGGC CAAGCTCGGC TCCGATCAGG TCGCCGCCCT CTCGACGGCG CAGGTGGCGG CGCTGGAGAC GGCGGATCTG GCGACCCTCT CGGCCACGGG CGTGAAGGGC TTCGGATCGG CACAGGCGGC GGCCCTCGGC TCGGCACAGG TGGCGGCGTT CACCACGGCG CAGGTGGCGG CGCTGACCAC GGCGGCGGTG AAGGGCTTCG GCTCGGTGCA GGCCTCGGGC CTCACCACCG CGCAGGTGGC CGCGCTGACC ACGGCGCAGC TCTCGCAGCT CTCGACGGCG GCGGTGAAGG GGCTCGGCAC GGCGCAGATC GTGGCGCTGA CCACGGGCCA GACGGCGGCG CTCGGCTCGG CGCAGCTGGG TGCCCTCTCG ACCGCGCAGG TGGCGGCCTT CGAGACGGCG GATGCCGCGG CGCTGACCAC GACGGCGCTG AAGGGGCTGA CCACCGCGCA GGTGGTGGCG CTGACGACGG GTCAGGCGGC GGCGCTCGGG TCGGTGCAGG TGGCGGGTCT CACGACCGCG CAGATGGCGG CGCTCGAGAC GGTGGATCTC GCGGCCCTCA CCACCACGGC AGTGAAGGGG ATCACCACCG CCCAGATGGG GGCGCTGACG ACGGGGCAGG TGGCCGCCCT CACCACGGCG CAGGTGGCCG CGCTTGCGGG CACGGCGGTG AAGGGACTGT CCTCGACCCA GGCGGGGGCG CTGACAACGG CACAGGTGGC GGCGCTGACC ACGGCGCAGG TGCCCCAGCT CTCGACGGCG GCGCTGAAAG GGCTCGGCAC GGCCCAGATC GTGGCGCTGA CCACGGCCCA GGCGGCCGTC CTCGGCTCGG CGCAGCTGGC GGGCCTCTCG ACGGTGCAGG TGGCGGCGCT CGAGACGGTC GATCTCGCGG CCCTGACCAC CGCGGCCGTG AAGGGCCTCG GCTCGGCCCA GGTCGCGGGC CTGACCACGG GCCAGGTGGC GGCCCTCACG ACGGCGCAGA TGGCCCAGCT CTCGACGGCG GCGATCGCGG GTCTGGGATC GGTGCAGGCC TCTGGCCTGA CCACGGGCCA GGTGGCGGCC CTCACCACCG ATCAGCTCGC CCGGATCACC ACCGCGGCGG TGAAGGGGCT CGGCACGGCG CAGATCGTGG CTCTGACCAC GGCGCAGGCG GCCACGCTCG GCTCGGCGCA ACTGGGCGCG CTCTCGACGG CGCAGGTGGC GGCCTTCGAG ACGGCGGATG TGGCGGCGCT GACCACCGCA GCGGTGAAGG GCTTCGGGAC AGCGCAGGTG GCGGCGCTGA CCACCGGGCA GGCGGCCGCC CTTGGCTCGC GTCAGGTGGG CGCGCTTTCC ACGGCGCAGG TGGCGGCGCT CGAGACGGCG GATCTCGCAG CCCTCACCAC CGCGGCCGTG AAGGGTCTGG GATCGGCGCA GGCGAAAGTC CTGACGGCGG CGCAGATGGC CGCGCTCACC TCGGCTCAGG TGGCGGCCCT CACCACGACC GCCGTGGCAG GCTTCGGCTC GGTGCAGGCG GCGGCGCTCA CCACGGCGCA GATGACGGCG CTCACCACCG CGCAGATCCC CACCCTGACC ACGGCCGCCA TCAAGGGTCT CGAAACCGCC GATATCGCGG CGCTCACCAC GACGCAGGCG TCGGCCTTCA CGGCCACGCA ACTGGGGGCC ATGTCGAGCG CCCAGATCGC GGCTCTCTTC CTCTGA
|
Protein sequence | MTDFSSKTTA EIAALTSAEV ASMSSQDLAA LSTAQIAALT AQQIGWVKAA SLKGLGDAQV VALTTAQAAA LGSAQLAALT TAQVAAMETA DLAALSATGV AGLTSAQLGG LSTGQVAALT TAQVAALSSV AVKGLGSVQA SGLTTAQVAA LSTAQLKAFS TAGMTGLGTA QIVALSSAQA AVLGSAQVAA LTTAQAAAME TADLAALTSV AVKGLSSTQV GALTTAQVAA LTTGQLGALS TGALKGLTTA QVVALTTAQA AGLGSAQVAG LSSTQIAALE TADLAALSTA GLKGLGSAQA GGLTTAQVAA LTTAQVGQLS SAALKGLGTA QVVALTTAQA AALGTAQVGA LSTAQVAALE TVDLAALSTA AANALTSAQV ASLTTAQVAA LTTAQVAALS TGAVKGLSST QAGALTTAQV AALTTGQLGA LSTGALKGLT TAQVVALTTA QAAGLGSVQV AGLSSTQIAA LETADLAALS TTGLKGLGSA QAAGLTTAQV AALTTAQVGQ LSSAALKGLG TAQIVALTTA QAAALGSTQV AGLSTAQVAA LETADLAMLS TAGVKALSST QVGALTTAQV AALTTAQAAQ ISTAAVKGLS STQVAALTTG QVAALTTAQL GALTTAALKG VTTAQVVALT TAQAAGLGSA LLAGLSSTQI AAIETADLAA LSTTGLKGLG SAQAAGLTTA QVAAFTTAQV GQLSTAALKG LGTAQIVALT TGQAGALGSA QVAGLSTAQV AALETADVAA LSTAGVKGLG SAQAAALGSA QVAALTTTQV GQLSTTALKG FGSVQASGLT TAQVAALTTT QLSQLSTAAV KGLGTAQIVA LTTGQTAALG SAQLGALSTA QVAAFETADA AALTTTALKG LTTAQVVALT TGQAAALGSA QVAGLSSTQI AALETADLAA LTTTAVKGLG STQVSSLTTG QVAALTTAQV AALSTAAVKG VGSVQASGLT TAQVAALTTA QVAQLSTAAL KGLGTAQIVA LTTAQAAKLG SDQVAALSTA QVAALETADL ATLSATGVKG FGSAQAAALG SAQVAAFTTA QVAALTTAAV KGFGSVQASG LTTAQVAALT TAQLSQLSTA AVKGLGTAQI VALTTGQTAA LGSAQLGALS TAQVAAFETA DAAALTTTAL KGLTTAQVVA LTTGQAAALG SVQVAGLTTA QMAALETVDL AALTTTAVKG ITTAQMGALT TGQVAALTTA QVAALAGTAV KGLSSTQAGA LTTAQVAALT TAQVPQLSTA ALKGLGTAQI VALTTAQAAV LGSAQLAGLS TVQVAALETV DLAALTTAAV KGLGSAQVAG LTTGQVAALT TAQMAQLSTA AIAGLGSVQA SGLTTGQVAA LTTDQLARIT TAAVKGLGTA QIVALTTAQA ATLGSAQLGA LSTAQVAAFE TADVAALTTA AVKGFGTAQV AALTTGQAAA LGSRQVGALS TAQVAALETA DLAALTTAAV KGLGSAQAKV LTAAQMAALT SAQVAALTTT AVAGFGSVQA AALTTAQMTA LTTAQIPTLT TAAIKGLETA DIAALTTTQA SAFTATQLGA MSSAQIAALF L
|
| |