Gene Rsph17029_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4068 
Symbol 
ID4894980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp70 
End bp4755 
Gene Length4686 bp 
Protein Length1561 aa 
Translation table11 
GC content75% 
IMG OID640110470 
Productlarge exoprotein involved in heme utilization or adhesion 
Protein accessionYP_001041782 
Protein GI126464806 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value0.516852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT TCAGCAGCAA GACCACAGCA GAGATCGCGG CCCTGACGAG CGCCGAGGTC 
GCCAGCATGT CGAGCCAGGA TCTGGCCGCC CTCTCCACCG CCCAGATCGC GGCGCTCACC
GCCCAGCAGA TCGGCTGGGT CAAGGCGGCG TCGCTGAAGG GGCTGGGGGA TGCGCAGGTG
GTGGCGCTGA CGACGGCGCA GGCGGCGGCG CTCGGCTCGG CGCAGCTGGC CGCGCTGACG
ACGGCGCAGG TGGCGGCGAT GGAGACGGCC GATCTCGCGG CGCTCTCGGC CACGGGGGTG
GCGGGGCTGA CTTCGGCGCA GCTCGGGGGG CTCTCGACCG GGCAGGTGGC GGCCCTCACC
ACGGCGCAGG TCGCTGCCCT GTCCAGCGTG GCGGTCAAGG GTCTGGGCTC GGTCCAGGCC
TCGGGTCTCA CGACGGCCCA GGTGGCCGCC CTGTCGACCG CCCAGCTCAA GGCCTTCTCG
ACCGCGGGCA TGACGGGGCT CGGCACGGCG CAGATCGTGG CGCTCTCGAG CGCGCAGGCG
GCGGTGCTCG GCTCGGCACA GGTCGCGGCA CTCACGACGG CGCAGGCGGC GGCGATGGAG
ACGGCCGATC TCGCGGCCCT CACCAGCGTG GCCGTGAAGG GGCTGAGCTC GACCCAGGTG
GGCGCGCTGA CCACGGCGCA GGTGGCGGCG CTGACCACGG GACAGCTCGG CGCGCTCTCG
ACCGGGGCGC TGAAGGGCCT GACCACGGCG CAGGTGGTTG CCCTGACCAC GGCGCAGGCG
GCCGGGCTCG GCTCGGCGCA GGTGGCGGGC CTGTCGAGCA CGCAGATCGC GGCGCTGGAG
ACGGCGGATC TGGCCGCCCT CTCCACGGCG GGGCTGAAGG GTCTGGGCTC GGCGCAGGCG
GGGGGCCTGA CCACGGCGCA GGTGGCGGCG CTCACGACGG CTCAGGTGGG CCAGCTCTCG
AGTGCCGCGC TGAAAGGGCT CGGGACGGCG CAGGTGGTGG CACTGACGAC CGCGCAGGCG
GCGGCGCTCG GCACGGCGCA GGTGGGCGCG CTCTCGACCG CACAGGTGGC GGCGCTCGAG
ACCGTCGATC TCGCGGCGCT CTCGACGGCG GCGGCGAATG CCCTGACCTC GGCTCAGGTC
GCGAGCCTCA CGACGGCGCA GGTGGCCGCG CTGACGACGG CGCAGGTCGC GGCGCTCTCG
ACGGGGGCGG TGAAGGGGCT GAGCTCGACC CAGGCGGGCG CGCTGACCAC GGCACAGGTG
GCGGCGCTGA CCACGGGGCA GCTCGGCGCG CTCTCGACCG GGGCGCTGAA GGGCCTGACC
ACGGCGCAGG TGGTGGCGCT GACCACCGCG CAGGCGGCGG GGCTCGGCTC GGTGCAGGTG
GCGGGGCTCT CGAGCACGCA GATCGCGGCG CTGGAGACGG CGGATCTGGC CGCGCTTTCC
ACGACGGGGC TGAAGGGTCT GGGCTCGGCG CAGGCCGCGG GCCTGACCAC GGCGCAGGTG
GCGGCGCTCA CCACGGCTCA GGTGGGCCAG CTCTCGAGTG CCGCGCTGAA AGGGCTCGGG
ACGGCGCAGA TCGTGGCGCT GACGACGGCG CAGGCGGCGG CGCTGGGGTC GACGCAGGTG
GCCGGGCTCT CGACCGCGCA GGTGGCGGCG CTGGAGACGG CCGATCTCGC GATGCTCTCG
ACCGCGGGGG TGAAGGCGCT GAGCTCGACG CAGGTGGGCG CGCTGACGAC GGCGCAGGTG
GCGGCCCTGA CGACAGCGCA GGCCGCCCAG ATCTCGACGG CGGCGGTGAA GGGCCTGAGT
TCGACGCAGG TGGCGGCCCT GACGACGGGG CAGGTGGCGG CCCTGACCAC GGCCCAGCTC
GGCGCACTCA CGACGGCGGC GCTGAAGGGC GTGACCACGG CGCAGGTGGT GGCGCTGACC
ACGGCGCAGG CGGCGGGGCT CGGCTCGGCG CTGCTGGCGG GCCTGTCGAG CACGCAGATC
GCAGCGATCG AGACGGCGGA TCTGGCCGCG CTCTCCACGA CCGGGCTGAA GGGTCTGGGC
TCGGCGCAGG CGGCGGGCCT GACCACGGCG CAGGTGGCCG CCTTCACCAC GGCCCAGGTG
GGGCAGCTTT CGACGGCGGC GCTGAAGGGG CTCGGCACCG CGCAGATCGT GGCGCTGACC
ACGGGCCAGG CGGGGGCGCT CGGCTCGGCG CAGGTGGCGG GTCTCTCGAC CGCGCAGGTG
GCGGCGCTCG AGACGGCCGA TGTCGCGGCG CTCTCGACGG CGGGGGTGAA GGGCTTGGGC
TCGGCGCAGG CGGCGGCGCT CGGCTCGGCG CAGGTGGCAG CGCTGACGAC GACGCAGGTG
GGCCAGCTTT CGACCACGGC CCTGAAGGGC TTCGGCTCGG TGCAGGCTTC GGGTCTCACC
ACGGCGCAGG TGGCGGCGCT GACCACGACG CAGCTCTCGC AACTCTCGAC GGCGGCGGTG
AAGGGGCTCG GCACCGCGCA GATCGTGGCG CTGACCACGG GCCAGACGGC AGCGCTCGGC
TCGGCGCAAC TGGGCGCCCT CTCGACGGCG CAGGTGGCGG CCTTCGAGAC GGCGGATGCC
GCGGCGCTGA CCACGACGGC GCTGAAGGGG CTGACCACCG CGCAGGTGGT GGCGCTGACG
ACGGGTCAGG CGGCGGCGCT CGGCTCGGCG CAGGTCGCGG GCCTGTCGAG CACGCAGATC
GCGGCGCTCG AGACGGCGGA TCTCGCGGCC CTGACCACCA CGGCGGTGAA GGGCCTGGGC
TCGACGCAGG TTTCGAGCCT GACGACGGGG CAGGTGGCGG CGCTCACCAC CGCGCAGGTG
GCGGCGCTGA GCACGGCGGC CGTGAAGGGC GTGGGCTCGG TGCAGGCCTC GGGGCTGACG
ACGGCGCAGG TGGCGGCGCT GACCACGGCC CAGGTGGCCC AGCTCTCGAC GGCGGCGCTG
AAGGGGCTCG GCACGGCGCA GATCGTGGCG CTGACCACGG CCCAGGCGGC CAAGCTCGGC
TCCGATCAGG TCGCCGCCCT CTCGACGGCG CAGGTGGCGG CGCTGGAGAC GGCGGATCTG
GCGACCCTCT CGGCCACGGG CGTGAAGGGC TTCGGATCGG CACAGGCGGC GGCCCTCGGC
TCGGCACAGG TGGCGGCGTT CACCACGGCG CAGGTGGCGG CGCTGACCAC GGCGGCGGTG
AAGGGCTTCG GCTCGGTGCA GGCCTCGGGC CTCACCACCG CGCAGGTGGC CGCGCTGACC
ACGGCGCAGC TCTCGCAGCT CTCGACGGCG GCGGTGAAGG GGCTCGGCAC GGCGCAGATC
GTGGCGCTGA CCACGGGCCA GACGGCGGCG CTCGGCTCGG CGCAGCTGGG TGCCCTCTCG
ACCGCGCAGG TGGCGGCCTT CGAGACGGCG GATGCCGCGG CGCTGACCAC GACGGCGCTG
AAGGGGCTGA CCACCGCGCA GGTGGTGGCG CTGACGACGG GTCAGGCGGC GGCGCTCGGG
TCGGTGCAGG TGGCGGGTCT CACGACCGCG CAGATGGCGG CGCTCGAGAC GGTGGATCTC
GCGGCCCTCA CCACCACGGC AGTGAAGGGG ATCACCACCG CCCAGATGGG GGCGCTGACG
ACGGGGCAGG TGGCCGCCCT CACCACGGCG CAGGTGGCCG CGCTTGCGGG CACGGCGGTG
AAGGGACTGT CCTCGACCCA GGCGGGGGCG CTGACAACGG CACAGGTGGC GGCGCTGACC
ACGGCGCAGG TGCCCCAGCT CTCGACGGCG GCGCTGAAAG GGCTCGGCAC GGCCCAGATC
GTGGCGCTGA CCACGGCCCA GGCGGCCGTC CTCGGCTCGG CGCAGCTGGC GGGCCTCTCG
ACGGTGCAGG TGGCGGCGCT CGAGACGGTC GATCTCGCGG CCCTGACCAC CGCGGCCGTG
AAGGGCCTCG GCTCGGCCCA GGTCGCGGGC CTGACCACGG GCCAGGTGGC GGCCCTCACG
ACGGCGCAGA TGGCCCAGCT CTCGACGGCG GCGATCGCGG GTCTGGGATC GGTGCAGGCC
TCTGGCCTGA CCACGGGCCA GGTGGCGGCC CTCACCACCG ATCAGCTCGC CCGGATCACC
ACCGCGGCGG TGAAGGGGCT CGGCACGGCG CAGATCGTGG CTCTGACCAC GGCGCAGGCG
GCCACGCTCG GCTCGGCGCA ACTGGGCGCG CTCTCGACGG CGCAGGTGGC GGCCTTCGAG
ACGGCGGATG TGGCGGCGCT GACCACCGCA GCGGTGAAGG GCTTCGGGAC AGCGCAGGTG
GCGGCGCTGA CCACCGGGCA GGCGGCCGCC CTTGGCTCGC GTCAGGTGGG CGCGCTTTCC
ACGGCGCAGG TGGCGGCGCT CGAGACGGCG GATCTCGCAG CCCTCACCAC CGCGGCCGTG
AAGGGTCTGG GATCGGCGCA GGCGAAAGTC CTGACGGCGG CGCAGATGGC CGCGCTCACC
TCGGCTCAGG TGGCGGCCCT CACCACGACC GCCGTGGCAG GCTTCGGCTC GGTGCAGGCG
GCGGCGCTCA CCACGGCGCA GATGACGGCG CTCACCACCG CGCAGATCCC CACCCTGACC
ACGGCCGCCA TCAAGGGTCT CGAAACCGCC GATATCGCGG CGCTCACCAC GACGCAGGCG
TCGGCCTTCA CGGCCACGCA ACTGGGGGCC ATGTCGAGCG CCCAGATCGC GGCTCTCTTC
CTCTGA
 
Protein sequence
MTDFSSKTTA EIAALTSAEV ASMSSQDLAA LSTAQIAALT AQQIGWVKAA SLKGLGDAQV 
VALTTAQAAA LGSAQLAALT TAQVAAMETA DLAALSATGV AGLTSAQLGG LSTGQVAALT
TAQVAALSSV AVKGLGSVQA SGLTTAQVAA LSTAQLKAFS TAGMTGLGTA QIVALSSAQA
AVLGSAQVAA LTTAQAAAME TADLAALTSV AVKGLSSTQV GALTTAQVAA LTTGQLGALS
TGALKGLTTA QVVALTTAQA AGLGSAQVAG LSSTQIAALE TADLAALSTA GLKGLGSAQA
GGLTTAQVAA LTTAQVGQLS SAALKGLGTA QVVALTTAQA AALGTAQVGA LSTAQVAALE
TVDLAALSTA AANALTSAQV ASLTTAQVAA LTTAQVAALS TGAVKGLSST QAGALTTAQV
AALTTGQLGA LSTGALKGLT TAQVVALTTA QAAGLGSVQV AGLSSTQIAA LETADLAALS
TTGLKGLGSA QAAGLTTAQV AALTTAQVGQ LSSAALKGLG TAQIVALTTA QAAALGSTQV
AGLSTAQVAA LETADLAMLS TAGVKALSST QVGALTTAQV AALTTAQAAQ ISTAAVKGLS
STQVAALTTG QVAALTTAQL GALTTAALKG VTTAQVVALT TAQAAGLGSA LLAGLSSTQI
AAIETADLAA LSTTGLKGLG SAQAAGLTTA QVAAFTTAQV GQLSTAALKG LGTAQIVALT
TGQAGALGSA QVAGLSTAQV AALETADVAA LSTAGVKGLG SAQAAALGSA QVAALTTTQV
GQLSTTALKG FGSVQASGLT TAQVAALTTT QLSQLSTAAV KGLGTAQIVA LTTGQTAALG
SAQLGALSTA QVAAFETADA AALTTTALKG LTTAQVVALT TGQAAALGSA QVAGLSSTQI
AALETADLAA LTTTAVKGLG STQVSSLTTG QVAALTTAQV AALSTAAVKG VGSVQASGLT
TAQVAALTTA QVAQLSTAAL KGLGTAQIVA LTTAQAAKLG SDQVAALSTA QVAALETADL
ATLSATGVKG FGSAQAAALG SAQVAAFTTA QVAALTTAAV KGFGSVQASG LTTAQVAALT
TAQLSQLSTA AVKGLGTAQI VALTTGQTAA LGSAQLGALS TAQVAAFETA DAAALTTTAL
KGLTTAQVVA LTTGQAAALG SVQVAGLTTA QMAALETVDL AALTTTAVKG ITTAQMGALT
TGQVAALTTA QVAALAGTAV KGLSSTQAGA LTTAQVAALT TAQVPQLSTA ALKGLGTAQI
VALTTAQAAV LGSAQLAGLS TVQVAALETV DLAALTTAAV KGLGSAQVAG LTTGQVAALT
TAQMAQLSTA AIAGLGSVQA SGLTTGQVAA LTTDQLARIT TAAVKGLGTA QIVALTTAQA
ATLGSAQLGA LSTAQVAAFE TADVAALTTA AVKGFGTAQV AALTTGQAAA LGSRQVGALS
TAQVAALETA DLAALTTAAV KGLGSAQAKV LTAAQMAALT SAQVAALTTT AVAGFGSVQA
AALTTAQMTA LTTAQIPTLT TAAIKGLETA DIAALTTTQA SAFTATQLGA MSSAQIAALF
L