Gene Rpal_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3804 
Symbol 
ID6411482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4083777 
End bp4086662 
Gene Length2886 bp 
Protein Length961 aa 
Translation table11 
GC content56% 
IMG OID642713685 
ProductSNF2-related protein 
Protein accessionYP_001992778 
Protein GI192292173 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.502587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAT ACACGCACTA TCACAGCAAA TATTTCGCCC ATCGCATCAT GTTGCAGGGG 
CGAAACGACG AAGCCTTCGC CAAGTCCCTA TCGACCGCTC GGGTCGATAT GAAGCCTCAC
CAGGTCGAGG CTGCTCGATT TGCATTGCAC TCGCCATTGT CCAAAGGCGT CATCCTTGCG
GACGAAGTCG GTTTGGGAAA AACCATCGAA GCGTGCTTGG TCATTGCCCA GAAATGGGCC
GAACGCCGTC GCCACATCCT CCTGATCGTT CCGGCATCGC TACGAACCCA ATGGCAGCAG
GAGCTTCAGC AGAAGTTCTC ATTGCCAAGT GTGGTTCTCG ACTCGAAGAC TCACCGGGAT
GCCCAGAAGG GCGGCCAGCC TCATCCCTTC GACAATCCTT CTGCGGTTGT CATCACCTCC
TATCAGTATG CCGCTCGAAA GCACGATGAA CTGCACCGCA TATCCTGGGA TCTGGTAGTC
ATCGACGAGG CGCACCGGCT ACGGAACGTC TACAAGAAGA GACAGTCCGC TCAGGCAAAG
AAGCTCCGCG ACGTACTTGC GTCTCGGTTC AAGGTGCTGC TGACCGCCAC CCCGTTGCAG
AACTCGCTGA TGGAGCTTTA TGGGCTCGTT TCTGTCATTG ATGACAGCTT CTTCGGTGAT
GAAGAATCTT TCCGCGGTAT GTATGGCCGG ACGACGGACA AGATCGCATT GCAAAGCCTT
CGGCGACGGC TGTCTCCGAT CTACAAGCGA CACCTCCGAC GCGATGTACA GGAGGCGGGT
CATGTCAGCT ATACGAAGCG GATTGCGGTG ACGTTTGACT TCGAGCCGCA CGACCGAGAA
GCACAGCTTT ACGAAGGGGT TTCGGAGTAT CTACAACGCA AAGACTCTAT AGCCTTCGGA
GCGAAGCCCA ATCAGCTCGT CCTGATCGGG GCACGCAAAA CCCTTGGCTC GTCCATCGCA
GCGATAACTC TATTTCTAGA GAATGTCATC GCACGGCTGC GCAAAAGCGA GGTCGCCGAC
GTCAGTGTCA TTGATGACAT CGATGACACG GCCGAGGTCA AGGAGGAAAT CGCGGATGAC
GTCATGCTCT CCGCGGATGC GAATGGAAAC GACGAGGATG AGAGCGATGA CCCCGCCGGC
GGACTCGATC CGAAGGCGCT CGCGGCCGAG ATTGCCGAAC TGGAAGGATA TCTCAGTCTT
GCCCGCTCTA TCGGCTCGAA TGCAAAGGGC GAGAAGTTAC TCGAACAGCT TCCCCATGTT
CTCGACGCGG TTCGAGCGCT CGGTGGTAAG CGGAAAGCGG TGATATTCAC CGAGTCTGTT
CGCACCCAGC GCTACCTCGC AGAGATTCTT TCTCAGAATG GGTACGCCGG ACAGATCGTG
CTGATGAACG GATCGAACAA CGATCCCGAG AGCCAGAAGA TCTACAAAGA ATGGAAGAAG
TCGAACGAGG GCACCGACGC GGTTTCCGGC TCAAAATCGG CGGACATGAA GACCGCGATC
GTCAACGCTT TCAAATCCGA TCAAAAGACG ATTCTGATCG CTACCGAGTC GGGGGCAGAA
GGCATCAATC TCCAGTTTTG CTCGCTCTTG ATCAACTTCG ATCTGCCCTG GAATCCGCAG
CGGGTGGAGC AGCGCATCGG CCGCTGCCAC CGTTACGGTC AAAAGATCGA TGTCACTGTG
GTGAACATGC TGAACCGCAA AAATCAGGCG GAGAAGCGAA TCTACGAACT GCTGAAGAAT
AAGTTCAACC TGTTTGAAGG CCTGTTTGGC GCCAGCGATC AGGTTCTCGG CACCATTGAA
TCGGGTATCG ATTTCGAGAA GAAGGTTTTG GCCGTTGTGC AGAGCTGCCG CAGTGAGGCG
GCAGTGCAAG AGGCTTTCGA GAAACTCGAA GAGGAGCTCG AGGAGAAGAT CAAGGCGGAT
ATGGCCGAGA CCCGCCAGCA GCTTTTCGAC ATCTTCGATG CATCCATCGT TGACGTGCTG
CGGCAGCGCG CGACGGACAT CGAGCGTACG ATGTCGGATT TCGAGCGACG TCTTTGGTTG
ATCGCCCGAG CCGAGCTTCC CAAGGCGAAG TTCCATAGCG ACGAAATCCC TCGTTTTGAA
CATGACGGAA AAATATGGTC GACCGTCTGG CCGGAGGCCG ATGAGCACGG ATGGCAGTTT
TTCCGGATGG GCGATGGCAC CCTTGCCGAC CGGTTGATCG AGGACGCCTG CGGCCGAGCC
CTTCCGGAAG CAAAGCTCGT ATTCGACTAC AAGGCCTATC GAAGTGAAGG GCTACCTCGC
CTGTCCGACA TCGAACAGGT TTCAGGAAAG GCGGGATGGC TCAAGGTGTC CGTCCTACGG
GCAGAGACGG CCCTTGGATC GAGGGACAGC ATCGTGCTCG CGGCGACGAC GGACGATGGG
CATGTTCTCG CCAAAGATAC GGCGGAGCGG CTTTTCCAAG TGCCGGCCAT CACCGATGCG
ATCGACGCGG CCTATCCAGG GAAGGCGATG GCGGCGATCG ACGGTGAGGC CATGAAGGCG
GCTCAGCAGG AAGCCGAGAA CCAAAGCCGG AATTGGCTCG ACGAGGAAAC CGCAAAATTG
GAAGCCTACG CCGAGGATCT GGAGAGTGCC AACAAGCGCC GGGAGAGGGA TCTCAAAGCG
GAGGCTGACG CAGCCAAACG TGCCCTTCGA GGCAATCAGT CAATGCCGCT CGCCGAGAAG
ATCGAGGAAG AGCGCCGGAT CAAGAAACTG GAGCAGGAAC GCGACGACTT GGTGTTCGAT
AGTTTCAAGA AGTTGCGGGA AATACGAAGA GAAATCGACG ACAAACTCAA TGATGTTTCT
GCCAAATTGG CGATCACACC GAAGATAACG CCTCTTATGA CGATCCGTTG GGAACTGACG
GCATGA
 
Protein sequence
MSEYTHYHSK YFAHRIMLQG RNDEAFAKSL STARVDMKPH QVEAARFALH SPLSKGVILA 
DEVGLGKTIE ACLVIAQKWA ERRRHILLIV PASLRTQWQQ ELQQKFSLPS VVLDSKTHRD
AQKGGQPHPF DNPSAVVITS YQYAARKHDE LHRISWDLVV IDEAHRLRNV YKKRQSAQAK
KLRDVLASRF KVLLTATPLQ NSLMELYGLV SVIDDSFFGD EESFRGMYGR TTDKIALQSL
RRRLSPIYKR HLRRDVQEAG HVSYTKRIAV TFDFEPHDRE AQLYEGVSEY LQRKDSIAFG
AKPNQLVLIG ARKTLGSSIA AITLFLENVI ARLRKSEVAD VSVIDDIDDT AEVKEEIADD
VMLSADANGN DEDESDDPAG GLDPKALAAE IAELEGYLSL ARSIGSNAKG EKLLEQLPHV
LDAVRALGGK RKAVIFTESV RTQRYLAEIL SQNGYAGQIV LMNGSNNDPE SQKIYKEWKK
SNEGTDAVSG SKSADMKTAI VNAFKSDQKT ILIATESGAE GINLQFCSLL INFDLPWNPQ
RVEQRIGRCH RYGQKIDVTV VNMLNRKNQA EKRIYELLKN KFNLFEGLFG ASDQVLGTIE
SGIDFEKKVL AVVQSCRSEA AVQEAFEKLE EELEEKIKAD MAETRQQLFD IFDASIVDVL
RQRATDIERT MSDFERRLWL IARAELPKAK FHSDEIPRFE HDGKIWSTVW PEADEHGWQF
FRMGDGTLAD RLIEDACGRA LPEAKLVFDY KAYRSEGLPR LSDIEQVSGK AGWLKVSVLR
AETALGSRDS IVLAATTDDG HVLAKDTAER LFQVPAITDA IDAAYPGKAM AAIDGEAMKA
AQQEAENQSR NWLDEETAKL EAYAEDLESA NKRRERDLKA EADAAKRALR GNQSMPLAEK
IEEERRIKKL EQERDDLVFD SFKKLREIRR EIDDKLNDVS AKLAITPKIT PLMTIRWELT
A