Gene Rsph17029_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3867 
Symbol 
ID4898998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp994429 
End bp995643 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content72% 
IMG OID640114471 
Productforkhead-associated protein 
Protein accessionYP_001045718 
Protein GI126464605 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.500687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAC TTCTCTGCGC GCTTGTCGGA TTGCTCTGGG CCGCGGCCGC CGCGGCACAG 
GATTTCAGCG CCGTCGCCCG GCTGACCGAC GCCTCGGTGG GCCGGATCTT CGTGGACCTT
CCCCGCGGAA TGAAGACCGG CGCCGGCATG GTCTTCGGCC AGGGCGCTGG GGGCAGGCTG
CTCTACCTGA CCAATTTCCA TGTTGTTCAG AACGGCGGCG ACATTGCCGT CTTCTTCAAG
AACGGCGATG CGATCTCGAT CTACAGCGGC ACGGTCCTTG CCGTCTCGCA GGAGAAGGAC
CTGGCGGCGC TGAGCCTCGA CCCCGAGGAG ATCGGCACGC CCTCGCCCCC GCCTCTCGCC
ATCGACACGC GACGGCTCGC GAAAGGCGAG GCCGTGGTGG CCATCGGCTA TCCCGCCGCG
GCCGACGCCA CCATGGGGCG CGACATGAAC ATCGCCGCCT TCGAGACCAC GCTGACCGGC
GGAAGTGTCA GCCGCGTGCT GAACGGCTCC TGGCGCGGGG CCTCGGCCGA GCTTGAGATC
GTCCAGCACA CCGCGCCGCT CAGCCCCGGC AATTCCGGCG GACCGCTGCT CGACCGGTGC
GGGCGGGTGA TCGGAGTGAA CACGAAGGGT GCGACGGACG CGGCCGGGAT CTACCTCGCC
TCCTCGGCCG GCACCATCGC GGACTTTCTG GCCGAGGCCG GGCTTCCGAT GCAGACGGCC
GGCGGCAGTT GCGGCGGAGA GCCGGCCTCT CCGCCTGCGG CGCCACCGGC TGCCACGGCG
ACGGACGGCC GGGTGCCGAT CTGGATGATC CTCGGCGGAA CCGGCGCCGC GCTGGCTTTG
GTCATGGCCG CGATGGCGGT TGCGGCCGGC GGCAGGGGCG GCGAGCCCGC GGCGCCCCGG
GTCGCGCTGA CGCTGACCAT CGCCACGGGC GGATCGCGCC AGCGGCGCGG CATCGGACGG
GCGAGCCTGC AACGCGGCGT TATCCTCGGA CGCGGCGGCG CGGCCGAGGT GGCGATCGAC
AGCCCGCGCG TCAGCCGCGA GCATCTGCGG CTGACGCTCG AGGGGCGGCG GCTGATGGCG
ACGGACCTCG GTTCGACCAA CGGGACGATG GTCGACGGCA GGCCGCTGCC ACCGAACCAG
CCGCGGCAGG TGGGCGAGGC CACGGTTCTG GTGCTGGGCG GCGAGGTCGA GATCCGGCTG
AGGGCGGGGT CATGA
 
Protein sequence
MMRLLCALVG LLWAAAAAAQ DFSAVARLTD ASVGRIFVDL PRGMKTGAGM VFGQGAGGRL 
LYLTNFHVVQ NGGDIAVFFK NGDAISIYSG TVLAVSQEKD LAALSLDPEE IGTPSPPPLA
IDTRRLAKGE AVVAIGYPAA ADATMGRDMN IAAFETTLTG GSVSRVLNGS WRGASAELEI
VQHTAPLSPG NSGGPLLDRC GRVIGVNTKG ATDAAGIYLA SSAGTIADFL AEAGLPMQTA
GGSCGGEPAS PPAAPPAATA TDGRVPIWMI LGGTGAALAL VMAAMAVAAG GRGGEPAAPR
VALTLTIATG GSRQRRGIGR ASLQRGVILG RGGAAEVAID SPRVSREHLR LTLEGRRLMA
TDLGSTNGTM VDGRPLPPNQ PRQVGEATVL VLGGEVEIRL RAGS