Gene RPC_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3679 
Symbol 
ID3969616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4096451 
End bp4099750 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content65% 
IMG OID637926789 
Producttrehalose synthase-like 
Protein accessionYP_533533 
Protein GI90425163 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCA TGCCCTCGAT CGAGCCCAAC GGCTCGCAGT CCAGCGTCGA GACCGACGAA 
TTGTGGTACA AAGACGCCAT CATCTATCAG CTGCACGTCA AGGCGTTTGC CGACAGCAAC
AATGACGGCA TCGGCGATTT CGCCGGCCTG ACCGAAAAGC TGGACTATCT GCAGGAGCTC
GGCGTCACCG CGCTGTGGCT GATGCCGTTC TATCCGTCGC CCGGCCGCGA CGACGGCTAC
GACATCGCCG ATTACGGCGC GGTCAGCCCC GACTACGGCA CCATGAAGGA TTTCCGCCGC
TTCATCGTCG AGGCGAAGAA GCGCGGGCTG CGCGTCATCA CCGAATTGGT CATCAACCAC
ACCTCGGATC AGCACGACTG GTTCAAACGC GCCCGCCGCA GCCCGAAGGA CTCGAGCGCG
CGGGACTGGT ACGTCTGGAG CGACACCGAC CAGAAATATG CCGGCACCCG CATCATCTTC
ACCGACACCG AGAAGTCGAA CTGGTCGTGG GACCCCGAGG CCAAGGCGTA TTATTGGCAC
CGCTTCTTCT CGCACCAGCC GGATCTGAAC TTCGACAACC CGCGGGTGGT GTCCGCGGTG
ATCCAGGTGA TGAAGCGCTG GCTCGACACC GGGGTCGACG GCTTCCGGCT CGACGCCATC
CCCTATCTGT GCGAGCGCGA CGGCACCAAC AACGAGAACC TCCCGGAGAC CCACGCGGTC
ATCAAGACGT TGCGCGCCGA ACTCGACGCC TATGCCAAGG GCAAGGTGCT GCTCGCCGAG
GCCAACCAGT GGCCGGAGGA CGTCCAGCAA TATTTCGGCC AGGGCGACGA ATGCCACATG
GCCTATCATT TCCCGCTGAT GCCCCGAATC TACATGGCGC TGGCGCAGGA AGACCGCTTT
CCGATCACCG ACATCCTGCG GCAGACCCCG GACATCCCGG CGAATTGCCA ATGGGCGATG
TTCCTGCGCA ACCATGACGA ACTGACGCTG GAAATGGTCA CCGATGTCGA GCGCGATTAT
CTGTGGTCGA CCTATGCCAA CGACCCGCGG GCGCGGATCA ATGTCGGCAT CCGCAGGCGC
CTTGCGCCAT TGATGGACAA CGACCGCCGC AAGATCGAAC TGATGAATTC GCTGCTGCTG
TCGTTTCCCG GCACGCCGAT CATCTATTAC GGCGACGAAA TCGGGATGGG CGACAACATC
TATCTCGGCG ACCGCAACGG CGTGCGCACC CCGATGCAGT GGACGCCCGA CCGCAACGGC
GGCTTCTCCC GCGCCGATCC CGCACGGCTC TATGCGCCGA CCATCATGGA CCCGGTGTAC
GGCTACGAGG CGGTCAACGT CGAAGCGCAG TCGCGCAGCC TGTCGTCGCT GCTCAGCGCC
ACCAAGCGGC TGATCTCGGT GCGCAAGTCG ACGCCGGCGT TCGGCCGCGG CACCATGAGT
TTCATCCGGC CGTCGAACCG CGCGGTGTTG GCCTATGTCC GGCAATACGA GGACGAAGTG
ATTCTCTGCG TCGCCAATCT GTCACGCTCC GCGCAGGCCA CCGAACTCGA TCTGTCGGCC
TGGAAAGACC GGATCCCGCA GGAAATGCTG GGACGAACCC GGTTTCCGGC GATCGGCGAA
CTGCCCTACA TGATCACGCT GGCGCCCTAC GGGTTCTATT GGTTCCGGCT GTGCGAGCGC
GACGCCTCGC AGCATCCGGC GCCCTCCGTG GTGCCGGAAT TCGAGACCCT GGTGGTGCCG
CTCGGCGCCA CCTGGATGAC GCTCGGCCGC ACCCGCGGCG TGTTCGAGCG CGACGTGCTG
CCGGCGCATC TGGCGCGCAC CCGGTGGTAT CCCGAACGCT CGGCGCGGGC GATCCATCCG
CGGCTGACCT CGGCGATCCC GTTCTGCATC GAGGGCGACA ACCGGCCCTG GCTGGCGTTC
TTCGAAGCCA CCCAGCGCGG GGTGACCTCG CGTTACGTGC TGCCGATGCA GATCGCCTGG
GTGCGGTTCG ATCGCGAACG CTTCAATCCG CGCGCCTTCG CCGCGGTGCG CCAGGGCGCC
CGCGAAGGCA CCCTGCTCGA CGTCGCCTCC GATCCCGACT TCGTGACGCT GCTGTTGCAG
AATCTGCGGG CGTCGCTGAC CGTCGAAGAG CAAGGCGTGC GGCTGGTGTT CCGGCCGACC
GCGCGGTTCG CCGAGAAGCC GGACAAGCCG TTGCAGAACA TTCGCGCCGT CGAAACCGAG
CAGTCCAACA GCACCACGCT GGTCGATGGC GATTATGTGG TGAAACTCTA TCGCCGGCTG
GAGACCGGGA TCAATCCCGA GATCGAGATG GGCCGCTTCC TGACCGACGT CGCCGGCTTC
GCCAACACCC CGGCGCTGAT GGGCAGCGTC GAACTGATCG AGGGCGACGC CACCAGCGCG
GTGGCGGTGG TGCACGCCTT CGTCGGCAAC CAGGGCGACG GCTGGACCGT GACCCAGGCC
TATCTCGACC GCTTCGTCGA CGAGCAGCGG CTGCTGACGA CCGCCGAGCA GCCCGGCGAG
AGCGACGAAC AGGTGCCATA TCTGCGCTAC ATGTCGCAAA CCGGTAAACG CGTCGCAGAG
ATGCATGTCG CGCTGGCCGC GCATCCGGAA GAGCCGGAGT TCGCTCCCGA ACCGATCACC
GCCGAGCAAG TGCAGGGCTG GGTCGACAGC GTCACCGGCT ACGCCGAACG GGTGATCGAC
GACTTGAAGC GGCGGCGCGA TCAGCTCAAG GAGGGCGATC GCCGGCTGGT CGACCAATTG
GCGGCGCTGC GCGAATCGAT CCGCGAACGG CTGCGCAGCC TGCTGCACGA AGACGGCGGC
GGCCTCAACA TCCGCCACCA CGGCGATTTT CATCTCGGCC AGATGCTGAT CGTCAAGGAC
GACATCTCGA TCATCGACTT CGAAGGCGAG CCGCGGCGCA GCCAGGCCGA ACGACGCCGC
AAGGCGCCGG CGGCGCGCGA CGTCGCCGGA TTGATCCGTT CGATTGACTA TTCGGCCACC
GCGGCGTTGG ATCGCGCGCT GAAAGCCGCC CCGGACGAGC AGGGCAAGCT TGCCTCCGCG
CTCGAAGCGT GGCGTAACCG GTCGACCGCC GCGTTTCTCA CGGCCTACCG CGAGTCCATG
ACCGATACCC GGTTATGGCC TGGTCATCCT GATGCAGCCG GTGGGATTCT CGATTTCTTC
CTGCTAGAAA AAGCCTTCTA CGAAATCGAA TACGAACTCG CCCACCGCCC CGATTGGCTC
CGCGTCCCGC TTGCCGGCGT CATTCGAATT TTGTCCCAGC GTTCCCAGGA GGTCACATGA
 
Protein sequence
MNIMPSIEPN GSQSSVETDE LWYKDAIIYQ LHVKAFADSN NDGIGDFAGL TEKLDYLQEL 
GVTALWLMPF YPSPGRDDGY DIADYGAVSP DYGTMKDFRR FIVEAKKRGL RVITELVINH
TSDQHDWFKR ARRSPKDSSA RDWYVWSDTD QKYAGTRIIF TDTEKSNWSW DPEAKAYYWH
RFFSHQPDLN FDNPRVVSAV IQVMKRWLDT GVDGFRLDAI PYLCERDGTN NENLPETHAV
IKTLRAELDA YAKGKVLLAE ANQWPEDVQQ YFGQGDECHM AYHFPLMPRI YMALAQEDRF
PITDILRQTP DIPANCQWAM FLRNHDELTL EMVTDVERDY LWSTYANDPR ARINVGIRRR
LAPLMDNDRR KIELMNSLLL SFPGTPIIYY GDEIGMGDNI YLGDRNGVRT PMQWTPDRNG
GFSRADPARL YAPTIMDPVY GYEAVNVEAQ SRSLSSLLSA TKRLISVRKS TPAFGRGTMS
FIRPSNRAVL AYVRQYEDEV ILCVANLSRS AQATELDLSA WKDRIPQEML GRTRFPAIGE
LPYMITLAPY GFYWFRLCER DASQHPAPSV VPEFETLVVP LGATWMTLGR TRGVFERDVL
PAHLARTRWY PERSARAIHP RLTSAIPFCI EGDNRPWLAF FEATQRGVTS RYVLPMQIAW
VRFDRERFNP RAFAAVRQGA REGTLLDVAS DPDFVTLLLQ NLRASLTVEE QGVRLVFRPT
ARFAEKPDKP LQNIRAVETE QSNSTTLVDG DYVVKLYRRL ETGINPEIEM GRFLTDVAGF
ANTPALMGSV ELIEGDATSA VAVVHAFVGN QGDGWTVTQA YLDRFVDEQR LLTTAEQPGE
SDEQVPYLRY MSQTGKRVAE MHVALAAHPE EPEFAPEPIT AEQVQGWVDS VTGYAERVID
DLKRRRDQLK EGDRRLVDQL AALRESIRER LRSLLHEDGG GLNIRHHGDF HLGQMLIVKD
DISIIDFEGE PRRSQAERRR KAPAARDVAG LIRSIDYSAT AALDRALKAA PDEQGKLASA
LEAWRNRSTA AFLTAYRESM TDTRLWPGHP DAAGGILDFF LLEKAFYEIE YELAHRPDWL
RVPLAGVIRI LSQRSQEVT