Gene Rsph17025_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4246 
Symbol 
ID5086427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp5264 
End bp6952 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content67% 
IMG OID640485807 
Producthypothetical protein 
Protein accessionYP_001170401 
Protein GI146280245 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.2464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.120296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA AGATCAAGCT CGCCCTCGCC TTTCTGTTCG TCTTCACGCT CTTCCTGATC 
GCAACCGGCC TGTCGCTCCA GCGCCTGCAG GCGATGAATG CCCGCATCGA GACTCTGGCG
CAGGTCGAGT TCCCCGCTGC GGTTCTTGTC GAGCGGATGG TGGCCGAGCA GCAGCGGGAA
TCCGCAGCAC TCCGGGATCA CATGGTGGCG ACCGAGGATG CAGAGCTCGA GAAGTTCGAA
AAGGCGGTCT TTGCCGCCCG AAATGAGCGG GACTCCCTCC TCGCGGAGCT TCAGGCGCTG
GTATCCAGTG AGGCCGAGCG GGCGAAGCTT GTCCGGATCG GCGAACTGAA CATCGAGGGG
AACAGGCGCA ACGACGCCGC GCTGGAGCGG TCGAAGCTGT TCGACCTCGG CGGGGCCGCA
CATGTCCTGC ACGACCCGTC CGCAAGCGGC AGCCGTGCCG AAAGGTCGCG CCTCCTGAGC
GAGTTGCGAG AGCATCAGGC GGCCGACGTT GCCGCGGCTG TCAACGCGTC TCGACAGACC
TACAGCACGG CGCTGCGCGA CCTGCTGATC GCGGTGGGGA CGGCCATCGT GACCGGGGCG
ATCGCGGCCT TCCTGATCAC CCGCTCGATC GTGCGTGGCC TCCGCGAGGC GCTGGCTCTG
AGCGAACGGG TAGCCAACGG CGACCTGATC GCCACTTCGA ACCTACGCGG CCGCGACGAG
ATCGGCCGGC TGCTGACCGC GAACAACCGC ATGATCCTGA AACTGCGCGA GGTGGTCGGA
ACCGTCGGCG CATCGGTGCA GCAGGTGACC GCCAACAGTT CTGCGATGGC AGCGACGTCG
GAAGAACTGT CGCAGGGCGC GCAGGAACAG GCCTCGGCCA CCGAGCAGGC CTCGGCCTCT
GTCGAGCAGA TGGCGGCCAA CATCAAGCAG ACAGCGGACA ATGCCGGTGC GACCGAGATC
ATGGCCCGGA AATCGGCTGA ACGGGCGCGC GCCTCGGGGA CGGCGGTGGC CGAAGCGGTC
TCCGCCATGC AGGCGATCGC AGAGCGGATC AACGTCGTCC AGGACATCGC ACGTCAGACC
GACCTTCTGG CGCTGAACGC CGCGGTCGAG GCCGCACGCG CCGGCGAGCA TGGGCGCGGC
TTCGCGGTCG TGGCCTCCGA GGTCCGAAAG CTGGCCGAGC GCAGTCAGAC CGCCGCGAGT
GAAATCTCGG CTCTCTCGAC GCGCACGGTC CAGACCGCGA CCGCTGCCGG CCAGATGCTC
GGCGAACTCG TGCCCGACAT CGAGAGCACC TCCAACCTCG TGAGCGGCAT CTCGGTGGCC
TCGCGCGAAT TGGCGCTGGG AGCGCAGCAG GTGGCCACGG CCATCCAGCA GCTTGATACG
GTCACGCAGC AGACGACATC GGCATCAATT GAACTGGCGT CCGGGGCGGA GGTCCTCTCC
GGACAGTCCG AAGAACTGCA GCGAACGATG ACCCATTTCC GTCTGGAGGC GACAACCGCC
ACCTCCAAGG CACCGGCAGC AGGGTCGGCC GTGCATGTCG AGGGTCCGGC TCGATCCCCT
CGCGGGGCCA CCCCGAACCT CAAGGCAGCA CCCAAAGCGA AAGCACCCGG AGGCTTCGCC
TTGGACATGG AAAAAGGTGC GTCCGACGAC ATGCTCGACA AGGATTTCCG CCGCCATGAC
GCCGCATGA
 
Protein sequence
MSIKIKLALA FLFVFTLFLI ATGLSLQRLQ AMNARIETLA QVEFPAAVLV ERMVAEQQRE 
SAALRDHMVA TEDAELEKFE KAVFAARNER DSLLAELQAL VSSEAERAKL VRIGELNIEG
NRRNDAALER SKLFDLGGAA HVLHDPSASG SRAERSRLLS ELREHQAADV AAAVNASRQT
YSTALRDLLI AVGTAIVTGA IAAFLITRSI VRGLREALAL SERVANGDLI ATSNLRGRDE
IGRLLTANNR MILKLREVVG TVGASVQQVT ANSSAMAATS EELSQGAQEQ ASATEQASAS
VEQMAANIKQ TADNAGATEI MARKSAERAR ASGTAVAEAV SAMQAIAERI NVVQDIARQT
DLLALNAAVE AARAGEHGRG FAVVASEVRK LAERSQTAAS EISALSTRTV QTATAAGQML
GELVPDIEST SNLVSGISVA SRELALGAQQ VATAIQQLDT VTQQTTSASI ELASGAEVLS
GQSEELQRTM THFRLEATTA TSKAPAAGSA VHVEGPARSP RGATPNLKAA PKAKAPGGFA
LDMEKGASDD MLDKDFRRHD AA