Gene Rsph17025_3017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3017 
Symbol 
ID5084616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3083246 
End bp3086629 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content71% 
IMG OID640484588 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001169206 
Protein GI146279047 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0756013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.525776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCATT GGCAGCCCAC CGCGATCCGT GGAACCGTCA GGCCGACCGG GCAACGGACG 
AGCGAGATGT CGATCAAGGC CAGCATCCAC CACCTGACCC ATTACCGATA CGACCGGCCG
GTGACGCTCG GGCCGCAGGT GATCCGCCTG CGCCCCGCGC CCCATTCCCG CACGCGCGTC
ATCTCGCATT CGCTGAAGGT CTCGCCCGGC GGCCATTTCG AGAGCCACCA GCAGGACCCT
TACGGCAACT GGCTGACGCG GATCGTCTTT CCCGAGCCGG TGACCGAGTT CCGCATCGAG
GTCGATCTGG TCGCCGATAT GGCGGTCTAC AACCCGTTCG ACTTCTTCGT CGAAGAGTCG
GCCGAGCACT GGCCCTTCGA TTATCCCGAC GACATCCTCG AGGATCTCTC GATCTACCGC
ACGCCCGAAC CGGCGGGGCC GCACCTGCGC GCCCTTCTCG ACACGATCCC GCGCGAGCGG
CGGCGGACGG TGGACATGGT CGTCGATCTG AACGCCCGCC TCGCGCGCGA GATCGCCTAT
GTGATCCGCA TGGAGCCCGG CGTCTTCAGC CCGGAGGAGA CGCTGGCCCA GGGCCGCGGC
TCGTGCCGTG ACAGCACCTG GCTGCAGGTG CAGATCCTGC GCCATCTGGG GTTCGCCGCC
CGCTTCGTCT CGGGCTACCT GATCCAGCTG AAGCCCGACC TCGAGGCGCT GGATGGTCCT
TCGGGCACCG ATCACGATTT CACCGACCTG CACGCCTGGG CCGAGGTCTA TCTGCCCGGC
GCGGGATGGA TCGGGCTCGA CGCGACCTCG GGGCTCCTGA CGGGCGAGAG CCACATCCCG
CTGGCCGCGA CCCCGCACTA CCGCAACGCA GCCCCCATTG CCGGCATGGC GAGCTACGCC
GAGGTGGATT TCGCCTTCGA CATGACGGTG GCCCGCGTCG CCGAGCATCC GCGGATCACC
AAGCCCTTCT CCGAGGACAG CTGGGAGCGT CTGAACGCGC TGGGGCACAG GGTCGATGCG
GCGCTGACGG CGGGCGACGT CCGGCTGACG ATGGGGGGCG AGCCCACCTT CGTCTCGATC
GACGATTTCG AGGCCGGCGA GTGGAACACC GAGGCCGTGG GTCCCACCAA GCGCGCCTTT
GCCGATCGGC TGATCCGGCG GCTGCGGGAC CGGTTTGCGC CGGGTGGCTT CCTGCATTAC
GGGCAGGGCA AATGGTATCC GGGCGAGACC CTGCCGCGCT GGACCTTCTC GCTTTACTGG
CGCGAGGATG GCCAGCCGAT CTGGCACGAT CCCGCGCTCG TGGCGGCCGA GGCGGCCCCG
GCCGACCTTG GCGCGGCCGA GGCCGAGCGT CTCATGCAGG GGATCGCCGC GGAACTGGGG
CTCGAGCCTG ATCTCGTGGT GCCCGCTTAC GAGGATCCGG GCGAGTGGCT GCTGAAGGAG
GGCAACCTCC CCGAGAACGT GACGCCCGAA AACTCGGAAC TGAAGGACCC CGAAGAGCGG
CTGCGCATGG TCCGCGTCTT CGAGCGTGGC CTGACCGAAC CTTCGGGCTT CGTGCTGCCG
GTGCAGCGGT GGCAGGCGCA GGCCGCCGGC CGGCGCTGGC GGTCCGAGCG GTGGAAGCTG
CGGCGCCGGC ATCTGTTCCT CGTGCCCGGC GACAGCCCGG TGGGCTACCG CCTGCCGCTC
GGCTCGCTGC CCCATGTGCC GCCCTCGCGC TATCCCTACA TCAACCCGAC CGATCCCACG
GTCGAGCGCG GGCCGCTGCC GCCCCCGGCG CAGACCGTGC CGCTCCAGAC GCCCGAAGCC
GCAGTGGCCG CCTTCACCGC TGCGACGCCC GGCCAGACCT TGGTCGAGCA GATCCTCGGC
GACGAAGGCG CGGTTCGCAC GGCGCTGGCG GTCGAGGTGC GGGACGGGCG GCTCTGCATC
TTCATGCCGC CCGTCGAGGC GGTCGAGGAT TATCTCGATC TCGTGGCTGC GGCCGAGGCG
GCGGCGGCCC GGCTGAACCT GCCGGTCCAT GTCGAGGGCT ACGCGCCCCC GCACGACCCC
CGGCTGAAGG TGATCCGCGT CGCCCCCGAT CCGGGCGTGA TCGAGGTGAA CATCCATCCG
GCCGCGAGCT GGGAGGAGTG CGTCTCGATC ACCACCGCCG TCTACGAGGA GGCGCGCCAG
TGCCGCCTCG GCGCCGACAA GTTCATGATC GACGGCCGCC ATTGCGGCAC GGGGGGCGGC
AACCATGTGG TGGTGGGGGG GCGGACGCCG ATGGACTCGC CCTTCCTGCG CCGGCCCGAT
CTGCTGCGCA GCCTGATCCT GCACTGGAAC CGGCACCCGT CGCTCTCCTG CCTCTTCTCG
GGCCTCTTCA TCGGCCCGAC GAGTCAGGCG CCCCGGATCG ACGAGGCCCG CCACGACAGC
CTCTACGAGC TTGAGATCGC GCTGGCGCAG ATCCCGGGGC CGCAGGACCC GCGCGCGCCG
CTCTGGCTGC CCGACCGGCT CCTGCGCAAC ATCCTGACCG ACGTCACCGG CAACACCCAC
CGGGCCGAGA TCTGCATCGA CAAGATGTTC TCGCCCGACG GACCGACGGG CCGCCTCGGT
CTGGTGGAGT TCCGCGGCTT CGAGATGCCG CCCGATCCGC GGATGAGCCT CGCCCAGCAG
CTTCTGATCC GCGCCCTCAT CGCGCGGATG TGGCAGAGCC CGGTCACCGG CGCCCTGACC
CGCTGGGGCA CGGCGCTGCA TGACCGCTTC ATGCTCCAGC ACCATGTCTG GGAGGATTTC
CTCGACGTGC TGGCCGACCT TCGGGCCCAC GGCTTCGACC TCGACCCCGA GTGGTTCCGG
GCGCAGGCCG AATTCCGCTT CCCCTTCTGC GGCGAGGTGA CCTGCGAAGG CGCGCATCTC
GAGATCCGTC AGGCGCTCGA ACCCTGGCAT GTGCTGGGCG AGACCGGCGC CATCGGCGGC
ACGGTGCGCT ACACCGACAG CTCGACCGAG CGGTTGCAGG TGACGCTCTC GGGCGCCGAT
CCGGCGCGCT ACCGCGTGGC CTGCAACGGG CGCGAGGTGC CGCTGCAGCC CGTGGCCAAC
GGCCGGGCGG TGGCGGGGGT GCGGTTCAAG GCCTGGCAGC CGGCGGCGGC GCTGCATCCG
AACCTGCCCG TCGATGCGCC GCTGACCTTC GACCTCTACG ACACCTGGTC GGGGCGCGCA
CTGGGCGGCT GCGTCTATCA CGTCGCCCAT CCCGGCGGGC GCAACTACGA GACCTTCCCC
GTGAACGGCA ACGAGGCGGA GGCGCGCCGC CTGGCCCGCT TCACGCCCCA CGGCCACAGC
GCCGGGCACT GGCCCCTTCG GCCCGAGCGT CCGCACCCCG AGTTTCCGAT GACGCTCGAC
CTGCGCCGGC CTGCGGGGCT CTGA
 
Protein sequence
MAHWQPTAIR GTVRPTGQRT SEMSIKASIH HLTHYRYDRP VTLGPQVIRL RPAPHSRTRV 
ISHSLKVSPG GHFESHQQDP YGNWLTRIVF PEPVTEFRIE VDLVADMAVY NPFDFFVEES
AEHWPFDYPD DILEDLSIYR TPEPAGPHLR ALLDTIPRER RRTVDMVVDL NARLAREIAY
VIRMEPGVFS PEETLAQGRG SCRDSTWLQV QILRHLGFAA RFVSGYLIQL KPDLEALDGP
SGTDHDFTDL HAWAEVYLPG AGWIGLDATS GLLTGESHIP LAATPHYRNA APIAGMASYA
EVDFAFDMTV ARVAEHPRIT KPFSEDSWER LNALGHRVDA ALTAGDVRLT MGGEPTFVSI
DDFEAGEWNT EAVGPTKRAF ADRLIRRLRD RFAPGGFLHY GQGKWYPGET LPRWTFSLYW
REDGQPIWHD PALVAAEAAP ADLGAAEAER LMQGIAAELG LEPDLVVPAY EDPGEWLLKE
GNLPENVTPE NSELKDPEER LRMVRVFERG LTEPSGFVLP VQRWQAQAAG RRWRSERWKL
RRRHLFLVPG DSPVGYRLPL GSLPHVPPSR YPYINPTDPT VERGPLPPPA QTVPLQTPEA
AVAAFTAATP GQTLVEQILG DEGAVRTALA VEVRDGRLCI FMPPVEAVED YLDLVAAAEA
AAARLNLPVH VEGYAPPHDP RLKVIRVAPD PGVIEVNIHP AASWEECVSI TTAVYEEARQ
CRLGADKFMI DGRHCGTGGG NHVVVGGRTP MDSPFLRRPD LLRSLILHWN RHPSLSCLFS
GLFIGPTSQA PRIDEARHDS LYELEIALAQ IPGPQDPRAP LWLPDRLLRN ILTDVTGNTH
RAEICIDKMF SPDGPTGRLG LVEFRGFEMP PDPRMSLAQQ LLIRALIARM WQSPVTGALT
RWGTALHDRF MLQHHVWEDF LDVLADLRAH GFDLDPEWFR AQAEFRFPFC GEVTCEGAHL
EIRQALEPWH VLGETGAIGG TVRYTDSSTE RLQVTLSGAD PARYRVACNG REVPLQPVAN
GRAVAGVRFK AWQPAAALHP NLPVDAPLTF DLYDTWSGRA LGGCVYHVAH PGGRNYETFP
VNGNEAEARR LARFTPHGHS AGHWPLRPER PHPEFPMTLD LRRPAGL