Gene Rpal_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3766 
Symbol 
ID6411444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4043534 
End bp4045282 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content61% 
IMG OID642713647 
ProductGMC oxidoreductase 
Protein accessionYP_001992740 
Protein GI192292135 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGTCG AGAACCTTGC TCATTTCAGC GCCGATACTC GTTTTGAAGC CGACCTCGTC 
ATTATCGGAG GCGGCCCAGC AGGTCTCACA GTAGCGCGCG AGCTGCGAGG TGCTCCGCTA
CGCGTCCTGA TCGTCGAGAG CGGGCTGCTT GAGGAGGCTC CTCAAGCAAA CGAACTCTCC
ACGCTGGAGA GCATCGGCGA GCCGAACGCC CCGCACCAGA TTCAGAAGAG GATTGCTTTT
CACGGCGCCA ATTGCCGTTC CTGGACTCAG GACCAGCAAC CTTATGGTGT CCGTTGTCGA
GCGCTCGGAG GCTCGACCCA TGCTTGGGCG GGAAAGTCAG CCGCCTTCGA CGACATCGAT
TTTCAGGAGC GGGCATGGGT ACCGCATTCC GGCTGGCCGA TGGACCGATC CAGCCTCGAT
CCGTTTCTCG ATCGAGCAGC CGATATTCTG AACCTCGGAC CGAACCGTTA CGACGACACC
CTTTGGACTC TGCTCGGCAT TTCGCCTCCC GAACCTCAAC TCGGCACGCA GGGGCTTCGC
TCGTTCTTCT GGCAGTTTTC GCGATCCCGG GTCGATCCTC TCGACATTAT GCGGTTCGGT
GCCGACTTTC TCAGGTTGGA AGCCGACAAC ATTCGCGTTC TGCTCAACGC GACCGCGACG
CGCCTCGATC TGACCCAGGA GGGAGACGGG TTTCGCGGAC TGGAGATCGC GACGAACGAC
GGGCAACGCC ACACCGTTCA CGCAAAGGCG GCCGTGATCG CCGGCGGCGG TATCGAAAAT
GCGCGCCTTC TGCTCGCGTC AACCACGATC CACGCCAATG GAGTCGGCAA TCAGTACGAT
CTGGTCGGGC GCTTCCTGAT GGATCACGCC GGCGCGCGGA TCGCCAGGTT CGGCGGATCG
AACGCCAACC AGATCATGCG GCGTTTTGGA TTCTACGGCC TCCGCAATAG CGGCCGCACC
TCCATGTACA TGCACGGCCT CGCCATCACG CCTGAGTTGC AGGAGCGAGA ACATCTTTTG
AATTCGGCGA TCTATTTCAT GCCGGAGCAT GCGGCGGACG ATCCATGGGA CGCCGTCAAG
CGGCTTCTTA CCGGCAAGAG CAAGCGACAG GTCAAGGATG CCATGCGGGC CATCGCCGGC
GCCGGCCTCC TTGGAAAGGG CGCCGCCATG AAGCTGCTGT CCAGCGACCG GCTCCCAGTC
GCTGTCAAGG ATTTCATCGT CCAAACTGCC ATTCGATACA TGCCTAATGT GGCCGCGGAC
GACTTCCTCA GTCGCGGCGT TCCACACAAG GTGACGGACG TGTGGATCGA CGCCATATCC
GAGCAGCAGC CCGATCCCAA CAGTCGCATC ACGTTGTCAG AGAAGACAGA CCGGTTGGGC
CTGCCGCTTG CTCGGGTCAA TTGGAAGATC AACGCTGACG AGCGCCGAAC CATCATGCGG
CTCGGCCACA TCGTGGAGTC GGCTTTTGCG CAAGCGCGGC TACCGAAGCC GATCCTGGAA
CCATGGGTCG CGGAAGGCGC GTTCAACGAC GCTGTGATCA TCGACATGGC GCACAGCATG
GGAACGACGC GGATGTCGGC GTCGGCGCGA TCAGGCGTGG TCGACGAACA GTGCCAAGTT
CACGGCGTTC GCGGCCTCTA CATCGCCGGC AGTTCGATCT TCCCGACCAG TGGCCACGCC
AATCCAACTC TTATGATCGT GGCCTTTGCG GTCCGTCTTG CCGACCGTAT CCGACAGACC
CTCCTATAG
 
Protein sequence
MLVENLAHFS ADTRFEADLV IIGGGPAGLT VARELRGAPL RVLIVESGLL EEAPQANELS 
TLESIGEPNA PHQIQKRIAF HGANCRSWTQ DQQPYGVRCR ALGGSTHAWA GKSAAFDDID
FQERAWVPHS GWPMDRSSLD PFLDRAADIL NLGPNRYDDT LWTLLGISPP EPQLGTQGLR
SFFWQFSRSR VDPLDIMRFG ADFLRLEADN IRVLLNATAT RLDLTQEGDG FRGLEIATND
GQRHTVHAKA AVIAGGGIEN ARLLLASTTI HANGVGNQYD LVGRFLMDHA GARIARFGGS
NANQIMRRFG FYGLRNSGRT SMYMHGLAIT PELQEREHLL NSAIYFMPEH AADDPWDAVK
RLLTGKSKRQ VKDAMRAIAG AGLLGKGAAM KLLSSDRLPV AVKDFIVQTA IRYMPNVAAD
DFLSRGVPHK VTDVWIDAIS EQQPDPNSRI TLSEKTDRLG LPLARVNWKI NADERRTIMR
LGHIVESAFA QARLPKPILE PWVAEGAFND AVIIDMAHSM GTTRMSASAR SGVVDEQCQV
HGVRGLYIAG SSIFPTSGHA NPTLMIVAFA VRLADRIRQT LL