Gene Rsph17025_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3330 
Symbol 
ID5085821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp205465 
End bp207609 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content66% 
IMG OID640484899 
Producthypothetical protein 
Protein accessionYP_001169516 
Protein GI146279358 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01586] cysteine protease domain, YopT-type
[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.641382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.572781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTC GACCCGAGGC CACAGCGCGC CCCCGTTCCC TCCCTCGTTG CCTGAAGGCC 
CTTCTCCTGG GATGCTCGGC TCTGGTGACA CTGCCGGGCG CAGCACTCGC GCAGGACGCC
TCACCGCAGG ACGCCGACAC CTACCGCCTC TCGCCCATCA TCGTCGATGC CGGCGCCGCG
GCCGATGATG ACGCCAATTC CATCGTCGCG CAGGAGCTGT GGGTGGGCGG CAAGGTCGCC
ACAAGCCTCC TCGACACGCC CGCCTCGGTC TCGGTCGTCA CGCAGAAGGA GATCCGGCAG
CGCAACGCGA CGACGACCGA GGAAGTGCTG CAATATTCCG CGGGCATCGT CTCGGACTAT
TACGGCAGCG ACGACCGCAA CGACTACTTC CTCGTCCGCG GCTTTCAGGC CTCGACCTAC
CGCGACGGCC TGACCCTCGG CTCGATGCGC GGCGTGCGGG AAGAGCCCTA TGCCTACGAG
CGGGTGGAGG TGCTGAAAGG TGCCAACTCC ACCCTCTTCG GCGTGTCCGA CCCCGGAGGA
TCGGTGAACT TCGTGACCAA GGCGCCGCAG TTCGCCCGCT TCGGCGAGAT CTACGGCCAG
ATCGGCACCC AGGACCACAA GGAGATCGGC GTCGATTTCG GTGACGTGCT GAACCCCGAT
GCCACGCTCG CCTACCGCTT CACCGCCAAG GTCAGGGACA GCGCGCTCGA TTACGACACG
TCGCGCGACG ATGAAACCTT CCTGATGGGC GGGCTGAGCT GGGCGCCGAG CGACGCCACC
ACCCTCTCGC TCATCTTCGA CCATCTGGAC CGCGATGGCA CGCCCAACAG CGGCGGCTAC
CCGCTGGACC GCGAGTACGA CCGGAGCAAC TTCTTCGGCG AAGCCGACTT CAACGACCAT
GATGTCGAAC GCGACACGCT GACCGCGATG CTCCGGCACG AGTTCGGCGG CGGGCTGAGC
CTGAGCGCGA ACCTGCGCTA CAGCGACCTC ACCGACGACT TCGGCTACAT CTATCTGTCG
GATTTCGCGG ACCGAACCGG CACGGAACTG TCGCGCTTCT ACTTCGGAAC CGACAGCACC
TCCGAAGAGC TGATCGGCAA CGCCATCCTG CAATATGACA CGCGCTTCGG CTCGACCGAC
AGCAGCACGC TGCTCGGGAT CGAATATCGC GACGCCTCGA CCAGCCAGAG CTCCTACTAC
GGGGCGGCGG GCACCATCGA CCTTGCCACC GGCATCGCGA CCGGCGTGCC GTCGGCCCTT
GCGCCCTACG AGGAGCGCGA GAGCGACTAC AAGACGAAGT CGGTGTTCCT GCAGCAGAAC
CTGTCCTTCG CGGACCGGAT CGTCGCGACG GTGGGTCTGC GCCACGACTG GCTGGATCTG
GCCACCCGCG GCCAGAGCTT CGGCGTGGCC TTCGACGATG CCGACGACTA CTCCGAAACC
TCGATCCGCG GCGCGCTGAC CTACAAGATC ACCGACGAGA TCTCGACCTA TGTCAGCTAT
GTCGAGTCGG TGGCCCCGCC CGCGATCGGG GTCGAGCCCG AGCGCGGCGA ACAGTATGAG
CTGGGCGTAA AATACCAGCC CACGGGCATG AACGCCCTTC TGTCGGCGGC CATCTTCGAT
CTGACCAAGA ACGACATCAC CATCCCCGTG GTTCTGGACG ACGGCACGAT CGAGCGGCAA
CTGATCGGCG AGACCCGCGT GCGCGGGGTC GAGATCGAGG GCAAGGCCGA GATCGCCGAG
AACTGGGACG TGATCGCCGC CTATTCCTAC CTCGACGCCG AGGCCACGCG GGCCGTCGTG
CGCGGCACCG ACGTTTCGGG CAACCGATTT GCCAACGTGC CCGAGCACAT GGCCTCGCTC
TGGGTGAACC GCACGCTGCC GGCGACGGAC GCGCGCGGCG AGATGACCTT CGGGGTCGGA
GCGCGTTATG TCGGATCGTA TCACTACGCG CTGCAGAACA ATACGGGGAA GAGCGAGTCG
ACAACCCTCT TCGACGCGGC CTTCAGCTAC GAGATCGCCG AGAACACCGG CATGGTCATC
AATGTGAGCA ACCTGTTCGA CAATCAACAT GTCGTGGGGC GCGGAACGGC CGACTACTAC
AACCCCGGCC GGTCGATCAC CGCGACGCTC CGCCGGACCT GGTGA
 
Protein sequence
MTPRPEATAR PRSLPRCLKA LLLGCSALVT LPGAALAQDA SPQDADTYRL SPIIVDAGAA 
ADDDANSIVA QELWVGGKVA TSLLDTPASV SVVTQKEIRQ RNATTTEEVL QYSAGIVSDY
YGSDDRNDYF LVRGFQASTY RDGLTLGSMR GVREEPYAYE RVEVLKGANS TLFGVSDPGG
SVNFVTKAPQ FARFGEIYGQ IGTQDHKEIG VDFGDVLNPD ATLAYRFTAK VRDSALDYDT
SRDDETFLMG GLSWAPSDAT TLSLIFDHLD RDGTPNSGGY PLDREYDRSN FFGEADFNDH
DVERDTLTAM LRHEFGGGLS LSANLRYSDL TDDFGYIYLS DFADRTGTEL SRFYFGTDST
SEELIGNAIL QYDTRFGSTD SSTLLGIEYR DASTSQSSYY GAAGTIDLAT GIATGVPSAL
APYEERESDY KTKSVFLQQN LSFADRIVAT VGLRHDWLDL ATRGQSFGVA FDDADDYSET
SIRGALTYKI TDEISTYVSY VESVAPPAIG VEPERGEQYE LGVKYQPTGM NALLSAAIFD
LTKNDITIPV VLDDGTIERQ LIGETRVRGV EIEGKAEIAE NWDVIAAYSY LDAEATRAVV
RGTDVSGNRF ANVPEHMASL WVNRTLPATD ARGEMTFGVG ARYVGSYHYA LQNNTGKSES
TTLFDAAFSY EIAENTGMVI NVSNLFDNQH VVGRGTADYY NPGRSITATL RRTW