Gene Rpal_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4920 
Symbol 
ID6412611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5298520 
End bp5300148 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content66% 
IMG OID642714802 
ProductNHL repeat containing protein 
Protein accessionYP_001993884 
Protein GI192293279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.610425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGCA AACTTGCGCT GGGCGCGAGT GCGTTGACGA TCGGCTTGCT GACGCTGTCA 
GTCGCACACG CCGACGGCTA TCAGGTCACC AAGCTGGTCC CGGGCTCGGC GTTTCACGGC
GTGCACGGCC TTGCGGTCGA CAAGGCCGGC AAGCTGTTCG CCGGCAGCGT CGCCGGCGCA
GCGCTGTATG AAGTCGATCG CGCCGCAGGT ACCGCCAAGA TCGCCGTGCC GACGCCGGAA
GGCATGGCCG ACGACATCGC GATCGCGCCG GACGGCACGA TGGCGTGGAC CGCCTTCCTC
ACCGGCGATC TCTATGCGCG CAAGGGCGAC GGCCCGATCA AGAAGCTGGC GTCCGGCCTG
CCGGGCATCA ACTCGCTCGC CTTCCGCAAG GATGGACGGC TGTACGCCAC CCAGGTGTTT
CTCGGCGATG CGCTGTACGA GATCGACGTC GAGGGCGCCA AGCCGCCACG CAAGATCATG
GAGAAGATGG GTGGGCTGAA CGGCTTCGAA TTCGGCCCCG ACGACAAGCT CTACGGCCCG
CTGTGGTTCA AGGGCCAGAT CGTCAAGGTC GATGTCGACA AGGGCGAACT CAGCATCGTC
GCCGACGGCT TCAAGGTGCC GGCAGCGGCG AATTTCGACT CCAAGGGCAA TCTGTGGGCG
CTCGATACGG CGCTCGGTCA GCTGGTCAAG ATCGATCCGA AGACCGGCGC CAAGCAGGTC
GCGGCGCAGC TCAAGCCGGC GCTCGACAAT CTCGCGATCG ATGCGAGCGA CCGCATCTTC
GTCTCCAACA TGGCCGACAA CGGCATCCAG GAAGTCGATC CGGCGACCGG CGCGGCCAAG
CAGGTGATCA TCGGCAAGCT GGCGTTTCCC GGCGGCATCG GCGTCGTTTC CGACGGCGGC
AAGGACACCA TCTACATCGC CGACGTCTTC GCCTATCGCA CCGTCGATGG CGCCAGCGGC
GAGGTGCGCG AAGTGGCGCG GATGCACGCC GACGGCACCA CGCTCGAATA TCCGATGAGC
GCCACCGCCA AGGGCGACGA GGTAATCCTG TCGAGCTGGT TCACCGGCAC GGTGCAGACG
ATCGACCGCA AGACCGGCCA GAGCCGCGAC ATGCTGCACG GCTTCAAGGC GCCTTACGAC
GCGATCCGGC TCGGCGGCGG CAAGCTGCTG GTCGCCGAAC TCGGCACCAA GTCGCTGGTC
GAAGTCTCGG GCGAGCACGG CAAGGACCGC AAGGCGATCG CCACCGATCT TGCAGGTCCG
GTCGGACTGG TCCTCGGCAA AGACAGCGCG GTGTATGTCA GCGAAGCGTT CGCCGGCCAG
ATCAGCAAGA TCGATCCGGT GACCGGCGCC AAGACGGTCG TCGCCAAAGA CCTGAAGATG
CCCGAGGGCA TCGCGCTCGC GCCGTCCGGC AAACTGATCG TCGCCGAAGT CGGTGCCAAA
CGCGTGGTCG AGGTCGATCC GGCCAGCGGC AGCGTGACGG AAATCGCCGG CAATCTGCCG
ATCGGCCTGG TCGGCGCCCC CGGCCTGCCG CCGACCAACA TGCCGACCGG CGTCGGTGTC
GGCGCTGGCG GCACGATCTA CGTGTCGTCC GATATCGAGA ATGCGATCTA CAAGATCGAG
AAGAAGTAG
 
Protein sequence
MKGKLALGAS ALTIGLLTLS VAHADGYQVT KLVPGSAFHG VHGLAVDKAG KLFAGSVAGA 
ALYEVDRAAG TAKIAVPTPE GMADDIAIAP DGTMAWTAFL TGDLYARKGD GPIKKLASGL
PGINSLAFRK DGRLYATQVF LGDALYEIDV EGAKPPRKIM EKMGGLNGFE FGPDDKLYGP
LWFKGQIVKV DVDKGELSIV ADGFKVPAAA NFDSKGNLWA LDTALGQLVK IDPKTGAKQV
AAQLKPALDN LAIDASDRIF VSNMADNGIQ EVDPATGAAK QVIIGKLAFP GGIGVVSDGG
KDTIYIADVF AYRTVDGASG EVREVARMHA DGTTLEYPMS ATAKGDEVIL SSWFTGTVQT
IDRKTGQSRD MLHGFKAPYD AIRLGGGKLL VAELGTKSLV EVSGEHGKDR KAIATDLAGP
VGLVLGKDSA VYVSEAFAGQ ISKIDPVTGA KTVVAKDLKM PEGIALAPSG KLIVAEVGAK
RVVEVDPASG SVTEIAGNLP IGLVGAPGLP PTNMPTGVGV GAGGTIYVSS DIENAIYKIE
KK