Gene Rsph17029_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2963 
Symbol 
ID4897972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3121416 
End bp3122813 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content65% 
IMG OID640113566 
Producthypothetical protein 
Protein accessionYP_001044837 
Protein GI126463723 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCT CTTCTTCGCT CAATGCGGGG GTGGCGGGAC TCAATGCCAA CGCCACCAAG 
CTCGCGACCA TTGCCGACAA TATCGCCAAT TCCGGCACCT ATGGGTACAA GCGCGCCGAT
GCCGATTTCG AGAGCATGGT CATCACGCAG TCGGCCAATG GCGGCTCCTA CGCCGCGGGC
GGGGTGCGGG CCAACACCAT ACGGCGGGTG GAGGAATGGG GCGCCCCCGT CTCGACCGCG
AGCGCCCTGG ATATCGCCAT CTCCGGCCGC GGGATGCTGC CCGTGATCCG CGAAACCTCC
CTCGGCGCCA GTTCGGGCAA TGAGGCGCTG CTGATGACGC GGACCGGCTC CTTCGCCACC
GATCCGAGCG GTGTGCTGAA GACGGACACC GGCCTCGTGC TCCTGGGCTG GCCTGCGACG
GCCGACGGCA CGATCCCCAC CATGTCGCGC GATTCCATTG CCCAGCTCGA GCCGGTCGTG
ATCGCCGCGA ACCAGACCGC GGCGGATCCC ACCACGCGGA TCAACCTCGG CGTCAACCTT
CCGGCCGAAG AGACCAAGTA CGGCGCCTCC GGTCGCGTTC TGACGGTCTC AGTGGAGTAT
TTCGGCAACC TCGGCACGTC GGAAACCCTG TCGGTCAGCT ACCATCCGAC GTTGCCGGCG
GCCGCCGGAG ACCCGCCGAC GAACAACTGG ACGATGGTCA TCCGGGATTC CGCCAATGCC
GACAAGGTCA TCGGCATCTA TACAATGCAA TTCAGCGATG CCCGCAGCGG CGGCGGCACG
ATCCTTGACG ATACCGCCAT CACCCCGCTC AACACGGTGA CGCTCGACGG CACCACCTAC
ACGGTCGAGA ATGCGGGCTC GACGCCCCCG GACAGCAGCC CGGCCTACGA TCCGCTGACG
GGCGGTATCG TTCTGGAAGT GGCGGGGAAT GCGGACAATA AGATCGCGAT GACCTTGGGC
AAGCCCGGCA GCGCGAACGC CATGACGCAG GTTTCAGGCG ATTTCGCCCA GACCGGGATC
ACCAAGGACG GCTCGCCGGT GGGCAACCTC ACGGGGGTCG AGATCGACGA CGGCGGCTTC
ATGCGCGCCA CCTACGACAC GGGGTTCACC CGCGTGATCT ACCAGGTGCC GCTGGTCGAC
GTGCCGAACT ACAACGGGCT CACCGCCCTG AACAACCAGA CCTTCGCGAT CTCGCCGGAT
TCAGGCAATT TCTACCTCTG GGACGCGGGC GATGGGCCCA CGGGCGACGT TCTGGGCTAT
GCGCGCGAGG GATCCACCAC TGATGTCGCG ACCGAGCTCA CCAACCTCAT CCAGACCCAG
CGCGCCTATT CCTCGAATGC CAAGATCATC CAGACGGTGG ATGAGATGCT GCAGGAAACC
ACCAATCTCA AGCGCTGA
 
Protein sequence
MTISSSLNAG VAGLNANATK LATIADNIAN SGTYGYKRAD ADFESMVITQ SANGGSYAAG 
GVRANTIRRV EEWGAPVSTA SALDIAISGR GMLPVIRETS LGASSGNEAL LMTRTGSFAT
DPSGVLKTDT GLVLLGWPAT ADGTIPTMSR DSIAQLEPVV IAANQTAADP TTRINLGVNL
PAEETKYGAS GRVLTVSVEY FGNLGTSETL SVSYHPTLPA AAGDPPTNNW TMVIRDSANA
DKVIGIYTMQ FSDARSGGGT ILDDTAITPL NTVTLDGTTY TVENAGSTPP DSSPAYDPLT
GGIVLEVAGN ADNKIAMTLG KPGSANAMTQ VSGDFAQTGI TKDGSPVGNL TGVEIDDGGF
MRATYDTGFT RVIYQVPLVD VPNYNGLTAL NNQTFAISPD SGNFYLWDAG DGPTGDVLGY
AREGSTTDVA TELTNLIQTQ RAYSSNAKII QTVDEMLQET TNLKR