Gene Rsph17029_2614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2614 
Symbol 
ID4897377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2755377 
End bp2758700 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content70% 
IMG OID640113214 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001044488 
Protein GI126463374 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.95635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA AGGCGAGCAT TCACCACCTG ACCCACTACC GCTACGACCG TCCGGTGACG 
CTCGGGCCGC AGCTGATCCG CCTGCGCCCC GCGCCCCATT CCCGCACGCG GGTGATCTCG
CATGCGCTGA AGGTCTCGCC GGGGGGGCAT TTCGAGAATC ATCAGCAGGA TCCCTACGGC
AACTGGCTGC TGCGCGTGGT CTTTCCCGAG CCGGTGACCG AGTTCCGCAT CGAGGTCGAT
CTGGTGGCCG ACATGACCGT CTACAATCCG TTCGACTTCT TCGTCGAGGA GACGGCCGAG
CACTGGCCCT TCGACTATCC CGAGGAGATC GTCGAGGATC TCTCGATCTA CCGCAGCCCC
GAACCTGCGG GGCCGCATCT TCAGGCCTTC CTTCAGACGA TCCCGCGCGA CCGGCAGCGG
ACCGTCGACA TGGTGGTCGG GCTGAACGCG CGCCTCGCCC GCGAGATCGC CTATGTGATC
CGAATGGAGC CCGGCGTCTT CAGCCCCGAG GAGACGCTGG CCCAGAGGCG CGGCTCGTGC
CGCGACAGCG CCTGGCTTCA GGTGCAGATC CTGCGCCACC TGGGCCTCGC CGCCCGCTTC
GTCTCGGGCT ACCTGATCCA GCTCAAGCCC GATCTCGAGG CGCTGGACGG GCCGTCGGGG
ACCGATCACG ATTTCACCGA CCTCCATGCC TGGGCCGAGG TCTATCTGCC GGGAGCGGGC
TGGATCGGGC TCGACGCCAC CTCGGGGCTT CTGACGGGCG AGAGCCACAT TCCGCTGGCC
GCGACCCCGC ATTACCGCAA CGCCGCCCCC ATCGCCGGCA TGGCGAGCTA TGCCGAGGTG
GATTTCGCCT TCGACATGAA GGTGACCCGC GTGGCCGAGC ATCCGCGGAT CACGAAACCC
TTTTCCGACG AGAGCTGGCA GCGGCTCGAT GCGCTGGGCC ACCGGGTCGA TGCGGCGCTG
AGGGCGGGCG ACGTGCGGCT CACCATGGGG GGCGAGCCCA CCTTCGTCTC GATCGACGAT
TTCGAATCGG GCGAGTGGAA CACCGATGCC GTCGGCCCCA CCAAGCGCGT CCTCGCCGAC
CGGCTGATCC GGCGGCTCCG CGACCGGTTC GCGCCGGGGG GCTTCCTGCA TTACGGGCAG
GGGAAATGGT ATCCGGGCGA GACCCTGCCG CGCTGGACCT TCTCGCTCTA CTGGCGCGAG
GACGGGGCGC CCATCTGGCG CGACGGCGCG CTGGTGGCGG GCGAGACGGG GCAGGGCGGC
GTGGGGCCGG CCGAGGCGGA GCGGCTGATG CAGGGCATCG CGGCCGAGCT CGGGCTCGAG
CCCGACCTCG TGGTGCCCGC CTACGAGGAT CCGGGCGAGT GGCTTCTGAA GGAGGCGAAC
CTTCCCGAAA ATGTGACGCC CGAGAATTCG GAGCTGAAGG ACCCCGAAGA GCGGCTGCGC
ATGGCCCGCG TCTTCGAGCG CGGTTTGACC GAGCCCTCGG GCTTCGTCCT GCCGGTGCAG
CGCTGGCAGG CGCAGGCGCC GCGCCCGCGC TGGCGGTCCG AACGGTGGCG GCTGCGGCGC
AGGCACCTGT TCCTCGTGCC CGGCGACAGC CCGGTGGGCT ACCGCCTGCC GCTGGGCGCG
CTGCCCCACA TCCCGGCCTC GCGCTACCCC TACATCAACC CGACCGATCC CACGGTCGAG
CGCGGGCCGC TGCCGCCCGC GGGCGAGGCG CAGATCGTGC CGCTCCAGAC CCCCGAGGCC
GCGGTGGCGA GCTTCACCGC CTCGGCGCCG GGCCAGACGA TGGTCGAGCA GATCCTCGGC
GACGAGGGCG CCGTGCGCAC CGCGCTCGCC GTCGAGGTGC GGGACGGGCG GCTCTGCATC
TTCATGCCGC CGGTGGAGGC GGTCGAGGAT TACCTCGACC TCCTGACCGC CGCCGAGGAG
GCCGCGCGCA AGCTGGGCCT GCCGGTCCAT GTCGAAGGCT ATGCGCCGCC GCACGATCCG
CGGCTGAACG TGATCCGCGT GGCGCCCGAT CCGGGCGTGA TCGAGGTGAA CATCCATCCC
GCCACCAGCT GGGAGGAGTG CGTGTCGATC ACGACGGCGG TCTACGAGGA AGCCCGCCAG
TGCCGCCTCG GCGCCGACAA GTTCATGATC GACGGCAAGC ATTGCGGCAC CGGCGGCGGC
AACCATGTCG TCGTGGGCGG GCGCACGCCC ATGGACTCGC CCTTCCTGCG GCGGCCGGAC
CTGCTGCGCA GCCTGATCCT GCACTGGAAC CGGCATCCGT CGCTCTCCTA CCTCTTCTCG
GGCCTCTTCA TCGGCCCGAC CAGTCAGGCG CCGCGCATCG ACGAGGCGCG CCACGACAGC
CTCTTTGAGC TGGAAATCGC GCTGTCGCAG ATCCCGGAGC CCGGCGATCC GCGCGCGGCC
CTCTGGCTGC CCGACCGGCT TCTGCGCAAC ATCCTGACCG ACGTGACCGG CAACACCCAC
CGCGCCGAGA TCTGCATCGA CAAGATGTTC TCGCCCGACG GGCCCACCGG GCGGCTCGGC
CTCGTCGAGT TCCGCGGCTT CGAGATGCCG CCCGACCCGC GCATGAGCCT CGCCCAGCAG
CTTCTGATCC GCGCCCTCAT CGCGCGCATG TGGCAGAACC CGGTGACGGG GCCGCTCACC
CGCTGGGGCA CGGCGCTGCA CGACCGTTTC ATGCTCCAGC ATTACGTCTG GGAGGATTTC
CTGGACGTGC TGGCCGATCT GCGGGCACAC GGGTTCGATC TCGACCCGGA ATGGTTCCGG
GCGCAGGCCG AGTTCCGCTT CCCCTTCTGC GGCGAGGTGA CCTACGAGGG CGCGCATCTC
GAGATCCGGC AGGCGCTCGA GCCATGGCAT GTGCTGGGCG AGACGGGCGC CATCGGGGGG
ACGGTGCGCT ACACCGACAG TTCGACCGAG CGGCTGCAGG TGACGCTCTC GGGCGCCGAT
CCCGCGCGCT ACCGCGTGGC CTGCAACGGG CGCGAGGTGC CGCTCGTGCC GGTGGCCAAT
GGCTGCGCCG TGGCGGGGGT GCGGTTCAAG GCCTGGCAGC CCGCCGCGGC GCTGCATCCG
ACCCTGCCCG TCGATGCGCC GCTCACCTTC GACATCTACG ACACTTGGTC GGGCCGGTCG
CTCGGCGGCT GCGTCTATCA TGTGGCCCAT CCCGGCGGGC GCAACTACGA GACCTTCCCG
GTGAACGGCA ACGAGGCCGA GGCGCGCAGG CTTGCGCGCT TCCAGCCCCA CGGGCACAGT
GCCGGCCTCT GGCCGCTCGC GCCCGAGCGG CCGCACCCGG AGTTTCCGAT GACGCTCGAC
CTGAGACGGC CCGCGGGGCT CTGA
 
Protein sequence
MSIKASIHHL THYRYDRPVT LGPQLIRLRP APHSRTRVIS HALKVSPGGH FENHQQDPYG 
NWLLRVVFPE PVTEFRIEVD LVADMTVYNP FDFFVEETAE HWPFDYPEEI VEDLSIYRSP
EPAGPHLQAF LQTIPRDRQR TVDMVVGLNA RLAREIAYVI RMEPGVFSPE ETLAQRRGSC
RDSAWLQVQI LRHLGLAARF VSGYLIQLKP DLEALDGPSG TDHDFTDLHA WAEVYLPGAG
WIGLDATSGL LTGESHIPLA ATPHYRNAAP IAGMASYAEV DFAFDMKVTR VAEHPRITKP
FSDESWQRLD ALGHRVDAAL RAGDVRLTMG GEPTFVSIDD FESGEWNTDA VGPTKRVLAD
RLIRRLRDRF APGGFLHYGQ GKWYPGETLP RWTFSLYWRE DGAPIWRDGA LVAGETGQGG
VGPAEAERLM QGIAAELGLE PDLVVPAYED PGEWLLKEAN LPENVTPENS ELKDPEERLR
MARVFERGLT EPSGFVLPVQ RWQAQAPRPR WRSERWRLRR RHLFLVPGDS PVGYRLPLGA
LPHIPASRYP YINPTDPTVE RGPLPPAGEA QIVPLQTPEA AVASFTASAP GQTMVEQILG
DEGAVRTALA VEVRDGRLCI FMPPVEAVED YLDLLTAAEE AARKLGLPVH VEGYAPPHDP
RLNVIRVAPD PGVIEVNIHP ATSWEECVSI TTAVYEEARQ CRLGADKFMI DGKHCGTGGG
NHVVVGGRTP MDSPFLRRPD LLRSLILHWN RHPSLSYLFS GLFIGPTSQA PRIDEARHDS
LFELEIALSQ IPEPGDPRAA LWLPDRLLRN ILTDVTGNTH RAEICIDKMF SPDGPTGRLG
LVEFRGFEMP PDPRMSLAQQ LLIRALIARM WQNPVTGPLT RWGTALHDRF MLQHYVWEDF
LDVLADLRAH GFDLDPEWFR AQAEFRFPFC GEVTYEGAHL EIRQALEPWH VLGETGAIGG
TVRYTDSSTE RLQVTLSGAD PARYRVACNG REVPLVPVAN GCAVAGVRFK AWQPAAALHP
TLPVDAPLTF DIYDTWSGRS LGGCVYHVAH PGGRNYETFP VNGNEAEARR LARFQPHGHS
AGLWPLAPER PHPEFPMTLD LRRPAGL