Gene Rpal_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4274 
Symbol 
ID6411958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4597551 
End bp4600052 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content70% 
IMG OID642714156 
Productprotein of unknown function DUF404 
Protein accessionYP_001993245 
Protein GI192292640 
COG category[S] Function unknown 
COG ID[COG2307] Uncharacterized protein conserved in bacteria
[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.306877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGC AAGGCGGCCC GGCCCGGAAG GCTCGACCGC GTGACCGCAA GGTTGCGCAA 
TGGAGCCGGG ATTATGTGCG CCTGCCCGGC ATCCCCGATG AATTCATCGG CGCCGACGGC
CAGCCGCGCG CGGTGTGGAG TCGTTATTTC GACGCGTTCG CAGCTTTGTC GACCGACGAG
ATCGAGCGCC GCTTCGCCGC CGCCGACCGG CATCTGCGCG ATGCCGGCGT CACCTATCGG
GCACCGGGCG ACGCCGTGGA TCGCGCCTGG CCGCTCAGCC ACGTTCCGCT GCTGATCGGC
GAGAGCGAGT GGCAACAGAT CGCCGCCGGC ATCGCCCAGC GCGCGACGCT GCTGGAAGGC
GTGCTGCAGG ACATCTACGG CGACGGCCGA TTGATCGCTG ACGGCGCGCT GCCGGCCGCC
GCGATCGCCG GCAGCGCCGA TTACTTGCGG CCGATGGTCG GGGTGAAGCC GCCCGGTGGC
CGCTATCTGC ATTTTTACGC CGCCGACATC GGCCGTGGCC CGGACGGGCA GTGGTGGGTG
CTCGGCGATC GCACCCAGGC GCCGTCCGGC GCCGGCTACG CGCTGGAAAA CCGCCTGGTG
CTGTCGCGCG CCTTCAACGA TCTCTACAAA TCGATGAATG TCGAGCGCGT CGCCTCGTTC
TTCGAAGCGT TCCGTGACAG CCTGCGCGCC ACTGCCGATC GCGACGAGCC GCGCATCGGG
CTGCTGACGC CGGGCCGGTT CAGCGAGACG TATTTCGAGC ACGCCACATT GGCGCGCTAT
CTCGGCTTCC TGCTGGTGGA AGGCGACGAC CTCGCGGTCG CCGATGATCG TCTGCATATC
CGCACCGTCG CCGGTCTGAA GCGGATCGAC GTGCTGCTGC GCCGGGTCGA TGCCAACTCG
CTCGATCCGC TGGAGCTCGA CGCTTCGTCG CAGCTCGGCG TCGCCGGCCT GATCGACGTG
ATCCGCAAGA GCGGCGTCGT GGTTGCCAAC ATGCCGGGCT CCGGCGTGCT CGAAGCGCGC
TCGCTGCTCG GCTTCCTGCC GCCGCTGGCG CGCCGGCTGC TCGGCGAAGA CCTCAAGATG
CCGCACATCG CCACCTGGTG GTGCGGCCAA ACGGCGGCGC GCGAGGAAGT GCTCGACCGG
CTCGATGAGT TCGTGATCGA AGGCGCTTAC GGCCAGAACG TGCCGGGCTT CTCGCATCAC
GGCCCGGTGC TGCCCGGCGA ATTGCCGCCG GTGGTGCGCG ATCATTTGCG CGACGCGATC
GCGGCGCGCG GGCTCGACTA CGTCGCGCAG GAGCAGGTCC GGCTGTCGAC CACGCCGGTG
TGGGACGACG GCAAGCTGGC GCCGCGGCCG TTCGTGCTGC GCGTCTTCGC CGCCGCGACC
GAACACGGCT GGGCGGTGAT GCCCGGCGGG TTCTGCCGGA TCGCCGACCG GCTCGACTCG
CGCGCGGTGT CGATGGGCGA GGGCGCCCGT GCCGCCGATG TCTGGGTGGT GGCCGATCAT
GCGGTGGCGC CGTCGTCGCT GCTACCCGCG GTCGACAGCG TCCGGATTCG GCGCATCACC
GGCGTGCTGC CGAGCCGCGC CGCCGATAAT CTGTTCTGGC TCGGCCGCTA TCTCGAACGC
GCCGAGGCGA CGCTGCGGCT GCTGCGCGCG CTCAGCGCGC CGCAGCGCGA TCCCGGCAAG
GGGCCGCTGC ACCACGCCAT CGAGAAGATC CAGCGGCTGC TGATCGCCTG GGGCGCGACC
TCGCTCGGCC CGCGCGCGCA GACCGCCAAG ATCGCCGCCG ATGCGCTGCA GAGCCCGGAT
GAATTCGGCT CGGCGCTGTC GCTGATCCGC GCCGCCCAGC GCAACGCTTC GTCGCTGCGC
GAGCGGCTGT CGCCGGATGC CTGGCACGTC ATCACCCAGA TGGAGGCGCG CCTCGCCGTC
CCGGTCGAGG GCGAAGAAGC GGTAGTTCAG GCTGCGGAAG TCGCGCTTCA GGAGTTGGCC
AGCTTCTCCG GCCTGTCGAA CGAAAACATG AACCGCGCCG CCGGCTGGCG CTTCCTGCGG
ATCGGCCGCC GTGTCGAGCG CGCGGTCAAC ACCGCGCGGT TCGCCCGGCA GTTCTCCTGC
GACGGCGCCA CCGCCGAAGA CCTCGACGTG CTGCTGACGC TGGTGGATTC GCAGATCACC
TATCGGTCGC GCTATCTGTT GGCACCGCTG CTGGCGCCGG TGCGCGATCT CGCTGTGCTC
GATCCCTACA ATCCGCGCTC GGTGGCGTTC CAGGTCGAAG AGCTCAACGA TCACGTCGCC
AGTCTGCCGG CGCTGCATGA GGGCGGCCTG ATCGAGCGGC CGCAGCGGCT GGCGGTCTCC
ACTTTGGCCA AACTGACAGC GGCCGAAGTC GAGACGATCG ATGCGTCGTG GTTGTTCGCG
CTGGAACAGG ATCTGCTCAG CCTCGCCGAG GCCGTCGGTT CGCACTACTT CCCGCACGGC
GCCAGCGCGA TGCGACCGGA AAAGCTGATG GGGCTGGCGT GA
 
Protein sequence
MAQQGGPARK ARPRDRKVAQ WSRDYVRLPG IPDEFIGADG QPRAVWSRYF DAFAALSTDE 
IERRFAAADR HLRDAGVTYR APGDAVDRAW PLSHVPLLIG ESEWQQIAAG IAQRATLLEG
VLQDIYGDGR LIADGALPAA AIAGSADYLR PMVGVKPPGG RYLHFYAADI GRGPDGQWWV
LGDRTQAPSG AGYALENRLV LSRAFNDLYK SMNVERVASF FEAFRDSLRA TADRDEPRIG
LLTPGRFSET YFEHATLARY LGFLLVEGDD LAVADDRLHI RTVAGLKRID VLLRRVDANS
LDPLELDASS QLGVAGLIDV IRKSGVVVAN MPGSGVLEAR SLLGFLPPLA RRLLGEDLKM
PHIATWWCGQ TAAREEVLDR LDEFVIEGAY GQNVPGFSHH GPVLPGELPP VVRDHLRDAI
AARGLDYVAQ EQVRLSTTPV WDDGKLAPRP FVLRVFAAAT EHGWAVMPGG FCRIADRLDS
RAVSMGEGAR AADVWVVADH AVAPSSLLPA VDSVRIRRIT GVLPSRAADN LFWLGRYLER
AEATLRLLRA LSAPQRDPGK GPLHHAIEKI QRLLIAWGAT SLGPRAQTAK IAADALQSPD
EFGSALSLIR AAQRNASSLR ERLSPDAWHV ITQMEARLAV PVEGEEAVVQ AAEVALQELA
SFSGLSNENM NRAAGWRFLR IGRRVERAVN TARFARQFSC DGATAEDLDV LLTLVDSQIT
YRSRYLLAPL LAPVRDLAVL DPYNPRSVAF QVEELNDHVA SLPALHEGGL IERPQRLAVS
TLAKLTAAEV ETIDASWLFA LEQDLLSLAE AVGSHYFPHG ASAMRPEKLM GLA