Gene Rpal_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4063 
Symbol 
ID6411747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4361662 
End bp4362903 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID642713945 
ProductExtensin family protein 
Protein accessionYP_001993034 
Protein GI192292429 
COG category[S] Function unknown 
COG ID[COG3921] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCG GAGTTCGTTT GTATCTCGTC GGCTCCTTCG TCCTCGTCTC TCTCGCGGGT 
TGCGGTCGCG GTCTGTTTCA GACCGCCGAG CGCGAACCGT GGCGAGCCGA GGCCGAGATC
GCGTGCTTGA AATCCGGCGT GGTCCGCGAA GGACCGGATC TGGTCCGGAT CGATCCGATC
TCAGGCCCTG GTGTGTGTGG TGCCGAGTTT CCGCTGAAGG TGGCGGCGCT CGGCGAAACC
GGCGCGATCG GTTTCGCCGA CGATCTGCGG CCGCCGGGTG CGATCGGCGG TGCCGGCCAA
AGCCAGCCGC GCTGGCCGGG CGGCCAGCCG CAACCGAACT ACGCGACACC TCAACGTGGC
TATGCCGAAC CGCCGGCGCG CGCGCCGAAC TACGGCGCAC AGCCGCAGGC CGGCTACGGC
GCGCCGCAGG GCGGCTACGG CAAAGCGCCG GTGTCGCTGA ACGCGCCGGG CGTGGGGCCG
GCTCAGGACG ATATCGAACT GCCGCCGGAA GGCGAGCCGT CCGCCGAGCG TCCGCCGGCC
GAGAACGTCA CCGGCTATCC GCGCGGTGCT GCGCCGCAGG GCGGCTATCC CGGCGAAGCG
GAGCGGCCGC TGCCGCGGCT CGGCCCGGGC CAGCAGGGCG GCATCACCGG CTCGGTGGGG
CCGGTTGCGA TCAAGCCGAC CGCGACGCTG GCGTGTCCGA TCGTGTCGGC GCTCGATCGC
TGGCTGGCGG AATCCGTGCA GCCTTCGGCG ATGCGCTGGT TCGGCGTCCG CGTCGTCGAG
ATCAAGCAGA TCTCGGCGTA TTCGTGCCGC GGCATGAACG GCAATCCGAA CGCCCACATC
TCCGAACACG CATTCGGCAA CGCGCTCGAT ATCGCCGCCT TCGTGCTGGC CGATGGCCGC
CGCATCACCG TCAAGGGCGG CTGGCGTGGA TTGCCGGAGG AGCAGGCGTT CCTGCACGAC
GTGCAGAACT CGGCGTGCCA GATGTTCACC ACGGTGCTGG CGCCGGGCTC GAACGTCTAT
CACTACGATC ACATCCACGT CGATCTGATG CGGCGGCGCA GCCAGCGCAC GATCTGCAAG
CCGGCCGCGG TGTCCGGCGA AGTGATCGCG CAGCGGCTGC AGCAGCGCAA TCCTTACGCG
GGCAGTGCGT CGCCGGGGCC GGGCTGGAAC GGCGTCACCG GCTCGATCGG CCGCAACGCG
TCGCGCCACA AGGTCGATCG CGACGAGGCC GAGGACGATT AG
 
Protein sequence
MTRGVRLYLV GSFVLVSLAG CGRGLFQTAE REPWRAEAEI ACLKSGVVRE GPDLVRIDPI 
SGPGVCGAEF PLKVAALGET GAIGFADDLR PPGAIGGAGQ SQPRWPGGQP QPNYATPQRG
YAEPPARAPN YGAQPQAGYG APQGGYGKAP VSLNAPGVGP AQDDIELPPE GEPSAERPPA
ENVTGYPRGA APQGGYPGEA ERPLPRLGPG QQGGITGSVG PVAIKPTATL ACPIVSALDR
WLAESVQPSA MRWFGVRVVE IKQISAYSCR GMNGNPNAHI SEHAFGNALD IAAFVLADGR
RITVKGGWRG LPEEQAFLHD VQNSACQMFT TVLAPGSNVY HYDHIHVDLM RRRSQRTICK
PAAVSGEVIA QRLQQRNPYA GSASPGPGWN GVTGSIGRNA SRHKVDRDEA EDD