Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4275 |
Symbol | |
ID | 6411959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4600078 |
End bp | 4603419 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714157 |
Product | transglutaminase domain protein |
Protein accession | YP_001993246 |
Protein GI | 192292641 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0684346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGATCT ATGTCGCCCT TCATCACGTC ACGCATTACA AATACGACCG TCTGGTCGAC ATCGGTCCTC AGACCATCCG GCTGCGTCCG GCGCCGCATA CGCGGACGCC GATTCTGTCG TATTCGCTGA AGGTCACGCC GGCGAACCAC TTCATCAATT GGCAGCAGGA CCCGCAGGGC AACTGGCTGG CGCGGTTCGT GTTTCCTGAG AAGGCGGACG AACTCAAGAT CGAGGTCGAT TTCACCGCGG CGATGACGGT GATCAACCCG TTTGACTTCT TCGTCGAGAG CTACGCCGAG AGCTTCCCGT TCTCATATAC CGGCGACCTG CAGCACGAGC TGGCGCCATA TCTGGCGACG ACCGAGCCGG GGCCGCTGTT CAAAGCCTAT CTCGATTCGA TTCCGCGTGA AGCGGAAAGC ACCGTCAACT TCCTGGTCGA CCTCAACGCC AAACTGCGCG AGCGGGTCAA CTACATCATC CGGATGGAAC CAGGGGTGCA GACGCCAGAG GAGACGCTGG CCAAAGGCGC CGGCTCGTGC CGCGACTCGG CGTGGCTGCT GATCCAGACG CTGCGGCATC TCGGTCTCGC GGCGCGGTTC GTGTCCGGTT ACTTGATCCA GCTTCGCCCC GACATCGAGT CGCTCGACGG CCCGAAGGGC GCCACGCACG ACTTCACCGA TCTGCACGCC TGGGCCGAAG TGTACCTGCC CGGCGCCGGC TGGGTCGGCT TCGACGTGAC TTCGGGATTG CTTGCGGGCG AGGGCCACAT CCCGGTCGCC GCCACGCCGC ATTATCGCAC GGCGGCGCCG ATATCCGGCG TGGTCGGCTT CGCCAATGTC GATTTCAAAT TCGACATGCG GGTCGCGCGC ATCCGCGAAG CGCCGCGGAT CACCATGCCG TTCTCCGACG AATCCTGGGC GAGGCTGGAT GCGCTCGGCG AAAAGGTCGA TGCCGATCTG GTCGCGCACG ACGTGCGGCT GACGATGGGC GGCGAGCCGA CCTTCGTATC GATCGACGAT CTGGAATCGC CGGAGTGGAA CGTCGCCGCG GTCGGCGGCG CCAAGCGGAT GCTGGCGGAC GATCTGATCC GGCGCCTGCG CACGCGGTTC GCACCCGGCG GCCTGCTGCA TTTCGGCCAG GGCAAATGGT ATCCGGGCGA AAGCCTGCCG CGCTGGGCGT TCGGTCTGTA TTGGCGCAAG GACGGCGTGC CGATCTGGAA CAACGCCGAA CTGATCGCGC CGGTGGTCGG CCAGCGGCCG GCGAAGGTCG AGGAGGCCGA GCAGTTCGCG ATCGGCACCG CGAAGCGGCT CGGCATCGAC ACCGACTACG TGCTGCCGGC CTATGAGGAT CCGAACCACT GGCTGCAGAA GGAAGCCGCG CTGCCGCCGA ATGTCGATCC GCAGGATAAC AAGCTGTCCG ATCCGGAAGA GCGTGCCCGG ATGGCGCGGG TGTTCGACAC CGGGCTGAAT ACGCCGCGCG GTTTCGTGCT GCCGATCCAG GCGTGGAATG CGGAAGCGAC GCCGGCGCAG AAGAAGCGCT GGCGCAGCGA GCGCTGGAAG CTGCGGCGCG GCAATCTGTT CCTGCTGCCG GGCGACTCGC CGCTCGGTTT CCGGCTGCCG ATTTCGTCGC TGCCGCACAT TCCGGAAGAC GACTATCCGT TCATCGTGCC GCGCGATCCG CTCGAGCCGC GCGGCACGCT GCCGCTGTTC GCGCCGCCGC CGGCGAACGA TGCCGATCCC GAGCGCGAGC AGATGCCGGT GTTCGAACAG TCGGCCGGCG AGGCGACGAC AAGTCAGCCG GTCGAGGAGC AGAAGCTTCG CAAAGGAGGG GTGCGCACCG CGATGTCGAT CGAGCTGCGC GAGGGCGTGC TCTGCGCCTT CATGCCGCCG ACAGAAACCA TCGAGGACTA TCTCGAGCTG ATCGCCGCGG TCGAAGCCAC CGCCGAAGAG ATGCAGATCC GGGTCCACAT CGAAGGCTAT CCGCCGCCTT ACGATCCGCG CATCGACGTC ATCAAGGTGA CGCCCGACCC GGGCGTGATC GAAGTCAACA TTCAGCCGGC CTCGAGCTGG CGCGAGGCAG TGCGGACCAC CTTCGGCCTG TATGAGGACG CCGCGCAGGT GCGGCTCGGC GCCAACCGCT TCCTGATCGA TGGCCGCCAC ACCGGCACCG GCGGCGGCAA CCACGTCGTG GTCGGCGGCG CCAGCCCCGC GGACTCTCCG TTCTTGAGAA GGCCGGATCT ACTCAAGAGT CTGGTGCTGT TCTGGCAGCG GCATCCGTCA CTGTCGTATC TGTTCTCCGG GATGTTCATC GGCCCGACCA GCCAGGCGCC GCGGATCGAC GAGGCGCGGC ACGATTCGCT GTATGAACTC GAAATCGCGC TGACCCAGGT GCCGCCGCCG GGTGTGAAGG GGCCGCTGTG GCTTGCTGAT CGACTATTCC GCAACATCCT GGTCGACATC ACCGGCAACA CCCATCGCGC CGAGATCTGC ATCGACAAGA TGTACTCGCC GGATAGTCCG ACCGGGCGTC TCGGTCTGGT CGAGTTCCGC GCGCTGGAGA TGCCGCCCGA TCCGCGGATG TCGCTAGCGC AGCAGCTTCT GATCCGCGCG CTGATCGCGA TGCTGTGGCG CGAGCCGCTC TCCGGCAAGT TCGTCCGCTG GGGCACGGCG CTGCACGACC GCTTCATGCT GCCGCATTAT TTGTGGGAGG ACTTCCGCGA CGTGCTCGGC GAGCTCGCAC GCGCCGGCTA TGCGTTCGAG TCGGAGTGGT TCACCGCGCA GCTCGAGTTT CGTTTTCCCG TGTTCGGCAG CGTGTATCAC GGCGGTGTCA CATTGGAGGT GCGGCAGGCG TTGGAGCCGT GGCACGTGCT GGGCGAAGAG GGGACCGCGG GCGGCACGGT GCGCTTTGTC GACTCGTCGG TGGAGCGGCT GCAGGTCAAG GCCGAGGGCT TCGTCGAGGG CCGCCACGTC ATCACCTGCA ACGGCCGCCG GCTGCCGATG ACGGCGACCG CGCGCTCCGG CGAAGCGGTG GCGGCGGTGC GGTTCAAGGC GTGGCAGCCG GCCTCCGGGC TGCATCCGAC CATTCCGGTG CACGCGCCGC TGGTGTTCGA CATCGTCGAC ACCTGGAACG GCCGCTCGCT CGGCGGCTGC GTCTATCACG TCGCCCATCC TGGCGGCCGC GCCTACGAGA CCAAGCCGGT GAACTCGTAC GAAGCCGAGG CCCGGCGGCT GGCCCGGTTC CAGGATCACG GCCACACCCC GGGCCGGATC GATTCGCCGC ATGAAGAACG CACACTTGAA TTCCCGCTGA CCCTCGACTT GCGCACGCCA CTGCTGCATT GA
|
Protein sequence | MSIYVALHHV THYKYDRLVD IGPQTIRLRP APHTRTPILS YSLKVTPANH FINWQQDPQG NWLARFVFPE KADELKIEVD FTAAMTVINP FDFFVESYAE SFPFSYTGDL QHELAPYLAT TEPGPLFKAY LDSIPREAES TVNFLVDLNA KLRERVNYII RMEPGVQTPE ETLAKGAGSC RDSAWLLIQT LRHLGLAARF VSGYLIQLRP DIESLDGPKG ATHDFTDLHA WAEVYLPGAG WVGFDVTSGL LAGEGHIPVA ATPHYRTAAP ISGVVGFANV DFKFDMRVAR IREAPRITMP FSDESWARLD ALGEKVDADL VAHDVRLTMG GEPTFVSIDD LESPEWNVAA VGGAKRMLAD DLIRRLRTRF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGVPIWNNAE LIAPVVGQRP AKVEEAEQFA IGTAKRLGID TDYVLPAYED PNHWLQKEAA LPPNVDPQDN KLSDPEERAR MARVFDTGLN TPRGFVLPIQ AWNAEATPAQ KKRWRSERWK LRRGNLFLLP GDSPLGFRLP ISSLPHIPED DYPFIVPRDP LEPRGTLPLF APPPANDADP EREQMPVFEQ SAGEATTSQP VEEQKLRKGG VRTAMSIELR EGVLCAFMPP TETIEDYLEL IAAVEATAEE MQIRVHIEGY PPPYDPRIDV IKVTPDPGVI EVNIQPASSW REAVRTTFGL YEDAAQVRLG ANRFLIDGRH TGTGGGNHVV VGGASPADSP FLRRPDLLKS LVLFWQRHPS LSYLFSGMFI GPTSQAPRID EARHDSLYEL EIALTQVPPP GVKGPLWLAD RLFRNILVDI TGNTHRAEIC IDKMYSPDSP TGRLGLVEFR ALEMPPDPRM SLAQQLLIRA LIAMLWREPL SGKFVRWGTA LHDRFMLPHY LWEDFRDVLG ELARAGYAFE SEWFTAQLEF RFPVFGSVYH GGVTLEVRQA LEPWHVLGEE GTAGGTVRFV DSSVERLQVK AEGFVEGRHV ITCNGRRLPM TATARSGEAV AAVRFKAWQP ASGLHPTIPV HAPLVFDIVD TWNGRSLGGC VYHVAHPGGR AYETKPVNSY EAEARRLARF QDHGHTPGRI DSPHEERTLE FPLTLDLRTP LLH
|
| |