Gene RSP_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2446 
Symbol 
ID3720043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1080739 
End bp1084047 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content68% 
IMG OID640070627 
Productputative trehalose synthase 
Protein accessionYP_352508 
Protein GI77463004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.500894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGA ATGTCTCTCC CTCCACCGTT GCCGTCCGCC GCACGGACGG CATCGATCGC 
AGCCAAACCG ACTGGTACAA GGACGCGATC ATCTATCAGC TGCACATCAA GGCGTTTCAG
GACGCCAATG GCGACGGGAT CGGTGACTTC GCGGGCCTGA TGCAGCGGCT GGATTATGTG
CAGGCGCTGG GCGTGACCGC GATCTGGCTG CTGCCTTTCT ATCCCTCGCC GCTCCGCGAC
GACGGCTACG ACATCTCGGA CTATCGCTCG ATCAACCCGT CCTACGGCGC GATGCGCGAC
TTCAAGCTGT TCGTGCAGGA AGCGCACAAG CGCGGGCTGC GCGTCATCAC CGAGCTGGTC
ATCAACCACA CCTCCGACCA GCATCCCTGG TTCCAGCGGG CCCGGCGGGC GAAGAAGGGA
TCGGCCGCGC GCGACTGGTA CGTCTGGAGC GATACCGACC AGAAATTCCC CGAAACGCGG
ATCATCTTCC TCGATACGGA AAAGTCGAAC TGGACCTGGG ATCCGGTGGC AGGCGCCTAT
TACTGGCACC GCTTCTATTC GCACCAGCCC GACCTGAACT TCGACAATCC CCGTGTGCTC
GAGGAAGTGC TCAAGATCAT GCGCATGTGG CTCGAGATGG GCGTCGACGG GCTCCGGCTC
GACGCGATCC CCTATCTGGT CGAGCGCGAG GGCACCAACA ACGAGAACCT CCCCGAGACG
CACGATGTTC TCAAGAAGAT CCGGGCCAAC CTCGACCAGC ATTTCCCCGA CAGGATGCTG
CTCGCCGAGG CGAACCAGTG GCCGGAGGAC ACGCGCCCCT ATTTCGGCGA GGGCGACGAA
TGCCACATGG GCTTCCATTT CCCGCTGATG CCGCGGATGT ACATGGCGCT GGCGCAGGCC
GACCGGCATC CGATCACCGA CATCATCCGC CAGACGCCCG AGATCCCGGA AGGCTGCCAA
TGGGGCATCT TCCTCCGGAA CCACGACGAG CTGACGCTCG AGATGGTGAC GGCCGAAGAG
CGCGACTACA TGTGGCGCTT CTACGCCGAA GATTCGCGGG CGCGGATCAA TCTCGGCATC
CGGCGCAGGC TGGCGCCGCT GATGAAGAAC GACCGGCGCA AGATCGAGCT TCTGAACCAG
ATGCTGATGT CGATGCCCGG CACGCCCATC GTTTATTACG GCGACGAGAT CGGCATGGGC
GACAACTACT ATCTCGGCGA CCGGGATGGC GTGCGGACGC CCATGCAATG GTCGGCCGAC
CGCAACGGCG GCTTCTCGCG CTGCAATCCG CAGCAGCTCT ACCTGCCGAC GATCCTCGAT
CCGGTCTACG GCCATCAGGC GATCAACGTC GAGGCGCAGG CGGCGGACCC GTCGTCGCTC
CTGAACTGGA CGCGCCGCCT GATCGCGGTG CGCAAGCAGC ATCCGGCCTT CGGGCGCGGC
TCGATGAGCC TGCTCTATCC GCGCAACCGC AAGGTGCTGG CCTATGTCCG CAGCCACGAG
GGCACGGACA TCCTCTGCGT GGCGAACCTC TCGGACACGG CGCAGGCGGT CGAGCTGGAT
CTCGGGCGGT TCCGCGGCGC GGTGCCGGTC GAGCTCACGG GCCGGTCGGA GTTTCCGCCG
GTGGGCGATC TGCCCTACAT GCTGACGCTG CCGCCCTACG GCTTCTACTG GTTCATCCTC
TCCGACCAGC AGGCGCTGCC GAGCTGGCAC CAGCCGATGC CCGAGACGCT GCCGGATTTC
ATCACGCTCA CGACGCGCGA CGGGCGGGCC GAGACGGCGC TCACCGGGCG CGAGACGCGC
CAGCTCGAGG CCGACGTGCT GCCGAACTGG CTGCCGCTGC AACGCTGGTT CGGGGCCAAG
GAGGAGAAGA TCAGCGCGGT GAAACTGGCC GTTCTGGGCT CGCTGTCGGC CGATCATGCC
CTCGTCCGGC TCGAGGCCGA TGTGGGCGGC GAGGTGCAGC AGTATTTCCT GCCCGCCTCC
GCACTCTGGG GCGAGGAGCA GCTCCGGGCC GGGGCGCCCA AGCTGAGCTT CACCCTGGCC
AAGGTCCGGC GCGGGCCGCA GGTGGGCGCG CTCATCGACG GGGCCTATGA CGAGCAGATG
GCGCAGGCGA TGCTCGAGGC GCTGCGCGAC GGTCGCAGAC TGTCTGGTGC CGGGGGCGAG
GTGGTGTTCG AGCCGGGCTC GGGACTGGCC GAGATCGCCG ACCCGGGCGA GCCGCGCTAT
CTCGGGGCCG AGCAGTCGAA CATTTCCATC GCCTTCGGCG ACCGCCTGAT CCTGAAGCTC
TACCGACGCC TGCGGGCCGG CGAGCAGCCG GACGTCGAGG TGGCGCGCTT CCTGACGGAA
GTGGCGGGCT ACACCCATAC GCCCCGCTAT CTCGGCGTCG TGAGCCTGCG CCCGGCGGAG
GGCGAGGCGA CCGTCCTCGC CGCGGCCTTC GCCTTCGTGG CCAACATGGG CGACGCCTGG
CGCGGCCTCT TCGACGCGCT GGTGCGGGAT CTGGGCGAGT TCGGCGGCTG GAGCACGGCC
GAGGCGCCGG AAGTGGAGAA GGACGGCTTC TCCTTCCCGC TCACCATTCC GGGCCTTCTC
GGCCAGCGCA CGGCCGAGCT GCATCAGGCT TTCGCCACCG AGACGGACGA GGCGGCCTTC
CGGATGGAGC CGCTCACGGC CCAGCATCTG ACCGGCTGGG CCGAGGGGGC GCGGCGCGAG
GCCGAGGCCA TGCTGTCGGT GCTCGAACGG CAGCGGACGC ACCTGCCCGA CGAGGTGCTG
CCCCTGGCCG AGGCGCTGCT CGCGCGGCGC GAAGACCTGC TGGCGCGGCT CGACCGCGTT
ACCGGCTTCG AACCCTCGGG CGCGCTGACC CGCATCCATG GCGACTACCA TCTGGGGCAG
GTGCTGCTTG CGCAGGACGA TGTCGCCATC ATCGACTTCG AGGGCGAACC GCGCCGCACG
CTTGCAGAGC GTCGCGAGAA GTCCTCGCCG CTCCGCGACG TGGCGGGGAT GCTGCGTTCG
TTCGACTATG CCGCCGCCGC CGCCCTCGCA CGCCACGAGG AAAGCTTCGG CCCGGCGAGC
GAGCGGGCGG TGGAGCGGGC CGAGGCCTGG CGGCAACAGG CGGTGGCCGA TTTCCTTGCC
GCCTACGAGG GCGCGTCCGC GGGCACCGCG AGCCTGCCCT CCGACCCGGC GCTGAAGGAG
GCGCTGCTCG ATCTCTTCCT CGTGCAGAAG GCGGTCTACG AGACCTCCTA CGAACTGGGC
AGCCGTCCCG CCTGGGTCGG CATCCCGCTG CGCGGGCTTT TGGAACTGCT GGACAGGAAA
CCTGCATGA
 
Protein sequence
MNQNVSPSTV AVRRTDGIDR SQTDWYKDAI IYQLHIKAFQ DANGDGIGDF AGLMQRLDYV 
QALGVTAIWL LPFYPSPLRD DGYDISDYRS INPSYGAMRD FKLFVQEAHK RGLRVITELV
INHTSDQHPW FQRARRAKKG SAARDWYVWS DTDQKFPETR IIFLDTEKSN WTWDPVAGAY
YWHRFYSHQP DLNFDNPRVL EEVLKIMRMW LEMGVDGLRL DAIPYLVERE GTNNENLPET
HDVLKKIRAN LDQHFPDRML LAEANQWPED TRPYFGEGDE CHMGFHFPLM PRMYMALAQA
DRHPITDIIR QTPEIPEGCQ WGIFLRNHDE LTLEMVTAEE RDYMWRFYAE DSRARINLGI
RRRLAPLMKN DRRKIELLNQ MLMSMPGTPI VYYGDEIGMG DNYYLGDRDG VRTPMQWSAD
RNGGFSRCNP QQLYLPTILD PVYGHQAINV EAQAADPSSL LNWTRRLIAV RKQHPAFGRG
SMSLLYPRNR KVLAYVRSHE GTDILCVANL SDTAQAVELD LGRFRGAVPV ELTGRSEFPP
VGDLPYMLTL PPYGFYWFIL SDQQALPSWH QPMPETLPDF ITLTTRDGRA ETALTGRETR
QLEADVLPNW LPLQRWFGAK EEKISAVKLA VLGSLSADHA LVRLEADVGG EVQQYFLPAS
ALWGEEQLRA GAPKLSFTLA KVRRGPQVGA LIDGAYDEQM AQAMLEALRD GRRLSGAGGE
VVFEPGSGLA EIADPGEPRY LGAEQSNISI AFGDRLILKL YRRLRAGEQP DVEVARFLTE
VAGYTHTPRY LGVVSLRPAE GEATVLAAAF AFVANMGDAW RGLFDALVRD LGEFGGWSTA
EAPEVEKDGF SFPLTIPGLL GQRTAELHQA FATETDEAAF RMEPLTAQHL TGWAEGARRE
AEAMLSVLER QRTHLPDEVL PLAEALLARR EDLLARLDRV TGFEPSGALT RIHGDYHLGQ
VLLAQDDVAI IDFEGEPRRT LAERREKSSP LRDVAGMLRS FDYAAAAALA RHEESFGPAS
ERAVERAEAW RQQAVADFLA AYEGASAGTA SLPSDPALKE ALLDLFLVQK AVYETSYELG
SRPAWVGIPL RGLLELLDRK PA