Gene Rpal_4275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4275 
Symbol 
ID6411959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4600078 
End bp4603419 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content66% 
IMG OID642714157 
Producttransglutaminase domain protein 
Protein accessionYP_001993246 
Protein GI192292641 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0684346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGATCT ATGTCGCCCT TCATCACGTC ACGCATTACA AATACGACCG TCTGGTCGAC 
ATCGGTCCTC AGACCATCCG GCTGCGTCCG GCGCCGCATA CGCGGACGCC GATTCTGTCG
TATTCGCTGA AGGTCACGCC GGCGAACCAC TTCATCAATT GGCAGCAGGA CCCGCAGGGC
AACTGGCTGG CGCGGTTCGT GTTTCCTGAG AAGGCGGACG AACTCAAGAT CGAGGTCGAT
TTCACCGCGG CGATGACGGT GATCAACCCG TTTGACTTCT TCGTCGAGAG CTACGCCGAG
AGCTTCCCGT TCTCATATAC CGGCGACCTG CAGCACGAGC TGGCGCCATA TCTGGCGACG
ACCGAGCCGG GGCCGCTGTT CAAAGCCTAT CTCGATTCGA TTCCGCGTGA AGCGGAAAGC
ACCGTCAACT TCCTGGTCGA CCTCAACGCC AAACTGCGCG AGCGGGTCAA CTACATCATC
CGGATGGAAC CAGGGGTGCA GACGCCAGAG GAGACGCTGG CCAAAGGCGC CGGCTCGTGC
CGCGACTCGG CGTGGCTGCT GATCCAGACG CTGCGGCATC TCGGTCTCGC GGCGCGGTTC
GTGTCCGGTT ACTTGATCCA GCTTCGCCCC GACATCGAGT CGCTCGACGG CCCGAAGGGC
GCCACGCACG ACTTCACCGA TCTGCACGCC TGGGCCGAAG TGTACCTGCC CGGCGCCGGC
TGGGTCGGCT TCGACGTGAC TTCGGGATTG CTTGCGGGCG AGGGCCACAT CCCGGTCGCC
GCCACGCCGC ATTATCGCAC GGCGGCGCCG ATATCCGGCG TGGTCGGCTT CGCCAATGTC
GATTTCAAAT TCGACATGCG GGTCGCGCGC ATCCGCGAAG CGCCGCGGAT CACCATGCCG
TTCTCCGACG AATCCTGGGC GAGGCTGGAT GCGCTCGGCG AAAAGGTCGA TGCCGATCTG
GTCGCGCACG ACGTGCGGCT GACGATGGGC GGCGAGCCGA CCTTCGTATC GATCGACGAT
CTGGAATCGC CGGAGTGGAA CGTCGCCGCG GTCGGCGGCG CCAAGCGGAT GCTGGCGGAC
GATCTGATCC GGCGCCTGCG CACGCGGTTC GCACCCGGCG GCCTGCTGCA TTTCGGCCAG
GGCAAATGGT ATCCGGGCGA AAGCCTGCCG CGCTGGGCGT TCGGTCTGTA TTGGCGCAAG
GACGGCGTGC CGATCTGGAA CAACGCCGAA CTGATCGCGC CGGTGGTCGG CCAGCGGCCG
GCGAAGGTCG AGGAGGCCGA GCAGTTCGCG ATCGGCACCG CGAAGCGGCT CGGCATCGAC
ACCGACTACG TGCTGCCGGC CTATGAGGAT CCGAACCACT GGCTGCAGAA GGAAGCCGCG
CTGCCGCCGA ATGTCGATCC GCAGGATAAC AAGCTGTCCG ATCCGGAAGA GCGTGCCCGG
ATGGCGCGGG TGTTCGACAC CGGGCTGAAT ACGCCGCGCG GTTTCGTGCT GCCGATCCAG
GCGTGGAATG CGGAAGCGAC GCCGGCGCAG AAGAAGCGCT GGCGCAGCGA GCGCTGGAAG
CTGCGGCGCG GCAATCTGTT CCTGCTGCCG GGCGACTCGC CGCTCGGTTT CCGGCTGCCG
ATTTCGTCGC TGCCGCACAT TCCGGAAGAC GACTATCCGT TCATCGTGCC GCGCGATCCG
CTCGAGCCGC GCGGCACGCT GCCGCTGTTC GCGCCGCCGC CGGCGAACGA TGCCGATCCC
GAGCGCGAGC AGATGCCGGT GTTCGAACAG TCGGCCGGCG AGGCGACGAC AAGTCAGCCG
GTCGAGGAGC AGAAGCTTCG CAAAGGAGGG GTGCGCACCG CGATGTCGAT CGAGCTGCGC
GAGGGCGTGC TCTGCGCCTT CATGCCGCCG ACAGAAACCA TCGAGGACTA TCTCGAGCTG
ATCGCCGCGG TCGAAGCCAC CGCCGAAGAG ATGCAGATCC GGGTCCACAT CGAAGGCTAT
CCGCCGCCTT ACGATCCGCG CATCGACGTC ATCAAGGTGA CGCCCGACCC GGGCGTGATC
GAAGTCAACA TTCAGCCGGC CTCGAGCTGG CGCGAGGCAG TGCGGACCAC CTTCGGCCTG
TATGAGGACG CCGCGCAGGT GCGGCTCGGC GCCAACCGCT TCCTGATCGA TGGCCGCCAC
ACCGGCACCG GCGGCGGCAA CCACGTCGTG GTCGGCGGCG CCAGCCCCGC GGACTCTCCG
TTCTTGAGAA GGCCGGATCT ACTCAAGAGT CTGGTGCTGT TCTGGCAGCG GCATCCGTCA
CTGTCGTATC TGTTCTCCGG GATGTTCATC GGCCCGACCA GCCAGGCGCC GCGGATCGAC
GAGGCGCGGC ACGATTCGCT GTATGAACTC GAAATCGCGC TGACCCAGGT GCCGCCGCCG
GGTGTGAAGG GGCCGCTGTG GCTTGCTGAT CGACTATTCC GCAACATCCT GGTCGACATC
ACCGGCAACA CCCATCGCGC CGAGATCTGC ATCGACAAGA TGTACTCGCC GGATAGTCCG
ACCGGGCGTC TCGGTCTGGT CGAGTTCCGC GCGCTGGAGA TGCCGCCCGA TCCGCGGATG
TCGCTAGCGC AGCAGCTTCT GATCCGCGCG CTGATCGCGA TGCTGTGGCG CGAGCCGCTC
TCCGGCAAGT TCGTCCGCTG GGGCACGGCG CTGCACGACC GCTTCATGCT GCCGCATTAT
TTGTGGGAGG ACTTCCGCGA CGTGCTCGGC GAGCTCGCAC GCGCCGGCTA TGCGTTCGAG
TCGGAGTGGT TCACCGCGCA GCTCGAGTTT CGTTTTCCCG TGTTCGGCAG CGTGTATCAC
GGCGGTGTCA CATTGGAGGT GCGGCAGGCG TTGGAGCCGT GGCACGTGCT GGGCGAAGAG
GGGACCGCGG GCGGCACGGT GCGCTTTGTC GACTCGTCGG TGGAGCGGCT GCAGGTCAAG
GCCGAGGGCT TCGTCGAGGG CCGCCACGTC ATCACCTGCA ACGGCCGCCG GCTGCCGATG
ACGGCGACCG CGCGCTCCGG CGAAGCGGTG GCGGCGGTGC GGTTCAAGGC GTGGCAGCCG
GCCTCCGGGC TGCATCCGAC CATTCCGGTG CACGCGCCGC TGGTGTTCGA CATCGTCGAC
ACCTGGAACG GCCGCTCGCT CGGCGGCTGC GTCTATCACG TCGCCCATCC TGGCGGCCGC
GCCTACGAGA CCAAGCCGGT GAACTCGTAC GAAGCCGAGG CCCGGCGGCT GGCCCGGTTC
CAGGATCACG GCCACACCCC GGGCCGGATC GATTCGCCGC ATGAAGAACG CACACTTGAA
TTCCCGCTGA CCCTCGACTT GCGCACGCCA CTGCTGCATT GA
 
Protein sequence
MSIYVALHHV THYKYDRLVD IGPQTIRLRP APHTRTPILS YSLKVTPANH FINWQQDPQG 
NWLARFVFPE KADELKIEVD FTAAMTVINP FDFFVESYAE SFPFSYTGDL QHELAPYLAT
TEPGPLFKAY LDSIPREAES TVNFLVDLNA KLRERVNYII RMEPGVQTPE ETLAKGAGSC
RDSAWLLIQT LRHLGLAARF VSGYLIQLRP DIESLDGPKG ATHDFTDLHA WAEVYLPGAG
WVGFDVTSGL LAGEGHIPVA ATPHYRTAAP ISGVVGFANV DFKFDMRVAR IREAPRITMP
FSDESWARLD ALGEKVDADL VAHDVRLTMG GEPTFVSIDD LESPEWNVAA VGGAKRMLAD
DLIRRLRTRF APGGLLHFGQ GKWYPGESLP RWAFGLYWRK DGVPIWNNAE LIAPVVGQRP
AKVEEAEQFA IGTAKRLGID TDYVLPAYED PNHWLQKEAA LPPNVDPQDN KLSDPEERAR
MARVFDTGLN TPRGFVLPIQ AWNAEATPAQ KKRWRSERWK LRRGNLFLLP GDSPLGFRLP
ISSLPHIPED DYPFIVPRDP LEPRGTLPLF APPPANDADP EREQMPVFEQ SAGEATTSQP
VEEQKLRKGG VRTAMSIELR EGVLCAFMPP TETIEDYLEL IAAVEATAEE MQIRVHIEGY
PPPYDPRIDV IKVTPDPGVI EVNIQPASSW REAVRTTFGL YEDAAQVRLG ANRFLIDGRH
TGTGGGNHVV VGGASPADSP FLRRPDLLKS LVLFWQRHPS LSYLFSGMFI GPTSQAPRID
EARHDSLYEL EIALTQVPPP GVKGPLWLAD RLFRNILVDI TGNTHRAEIC IDKMYSPDSP
TGRLGLVEFR ALEMPPDPRM SLAQQLLIRA LIAMLWREPL SGKFVRWGTA LHDRFMLPHY
LWEDFRDVLG ELARAGYAFE SEWFTAQLEF RFPVFGSVYH GGVTLEVRQA LEPWHVLGEE
GTAGGTVRFV DSSVERLQVK AEGFVEGRHV ITCNGRRLPM TATARSGEAV AAVRFKAWQP
ASGLHPTIPV HAPLVFDIVD TWNGRSLGGC VYHVAHPGGR AYETKPVNSY EAEARRLARF
QDHGHTPGRI DSPHEERTLE FPLTLDLRTP LLH