Gene Rpal_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3997 
Symbol 
ID6411679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4283460 
End bp4285205 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content60% 
IMG OID642713879 
Productintegrase family protein 
Protein accessionYP_001992968 
Protein GI192292363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTG CAACCAATAT TAGCCGCCGG CCCGGTAGCC GGAATTATTA TGTTCGGATG 
GCCGTGCCGC GCGATCTCCA GGTGCGCATG GGCACGCCCG GAAAGCCCCG TAGAGAGCTT
CGCAAGTCGC TGAATACGCC GGACGCGCGG GAGGCCAAAC GCCTCTCACG GCCAATTCTG
GACGAATGGG AGCGCACATT TGCCGAGCTG CGGCGCCCCA AGCAGTTGAC GGAAGCCGAG
CTGCAGAACG CGATCTGGCG CCGATACCTT GAGCTGATCA ACGCCGACGA GAGGTTCCGG
CAAGAGCTGC CGACCGGCGA CGAACTGAAT GCGATCTGGG AATATCTGGA AGCCGAGTTC
GGCGAGCTGA ACATCACGGC CTACAGGATC TTCGAAGAGC TGCGGGACCG GTTCGAAAGC
AACCAGCGGG AGCGAGTCGA GCGGCTGGCG CAGATGAAGG TAGAAGCCGC CCGCGGCGAA
ACGAAGCTGA TCGCGGACGT GGTCGAGCAA GTCATCGAAG CCCGGCGGCT TGGGGTCGAT
CCGGGAACGC CCGAATACCG CAAGCTGGCC CAAGGGCTCC AACGTGCCGA GCTGGAAGGG
CTTAAGCGGA CGGTTGAGCG GGACGCTGGC GACTTCTCCG GCGAGTCCAA AGACAAGCTG
GTGCAGCAGC CGACCGTATT CGATCCGCCG AAGGGCGAGG GCATCCTAGA GCTTTACGAT
CGCTATGCGC GGGAGAAGTC GGGCAGGGTG TCGGCCGACA CTTGGGCGCA GAACCGGAAG
GTGGTGGCGC TCTTTGACAA CTTCGTTGGA GGCAACGCGC ACATTTCAGC GCTGACTCGG
AAGAACGTCC GGGAGTGGAA AGAGAAGTTG TTCGAATGGC CGGTGAAGGC GATCGAAGCA
AGCGAGTTCC GCGGGCTGTC GTTCCTCGAC ACGATCGAAC GCAACAAGGT CGTCGGCAAG
CCGGTGATCC AGCACAAGAC GATCAACCGA TATCTGGCTG CATTGGGCGG TTTCAGCGAC
TGGTTGCTGG CGAACGACTT TATCGGCGAG CAGATCATGC AGGGCATGTA TCTGGAAGTC
GATCGCCGGA AAAAGACGGT GCTGCCCTAC AGCGCCGATC AGATGCGCCG CATCTTCGAA
TCGCCTCTCT TCCACCGCTG CGGTGGTGAT AAGCTGGAGC ACCAGAAGGG CAACGTTGAA
GTCCGGGATT GGCGCTACTG GATACCTCTG ATCGCCGTCC ACTCCGGTGC CCGGCTCGGC
GAGATTTGCC AGTTGATGAC GGCCGACGTT CGGCAGCTTC ACGACGTCTG GATTTTCCAC
ATTACCGAGG AAGGCGGGGC GGGCACGAAG TCGACCAAGA CCGAAGGTTC GATGCGGGTG
GTGCCGATGC ATTCGAAGCT GATCGAACTT GGTTTCTTGA AATACCATGC CCGCATGTCG
GCGATGGGCG ACCGGTTGTT CCCTGAGATC AAGGCGGATG CGCGCGGCTA CATAAGCGGC
AAGGCGTCAA CGTTCTTCAA TGATTACTTC CGTGCGATCG GCGTGAAGTC TGACCGCTCT
TTGAATTTCC ACAGCTTCCG GCATGGCTTT GCAGACGCGC TGCGGCGGGC TGGCTACTAT
GACGAACAGT TTGGGCCGCT ACTTGGGCAT ACGAAATCGA CCACGACGGG GCGCTATGGC
ATCGAATCTG AAATGGTGAT CGCAGATCGT GTCAAGATGG TCGAAGCGGT CATCCAAGCA
AAGTGA
 
Protein sequence
MAIATNISRR PGSRNYYVRM AVPRDLQVRM GTPGKPRREL RKSLNTPDAR EAKRLSRPIL 
DEWERTFAEL RRPKQLTEAE LQNAIWRRYL ELINADERFR QELPTGDELN AIWEYLEAEF
GELNITAYRI FEELRDRFES NQRERVERLA QMKVEAARGE TKLIADVVEQ VIEARRLGVD
PGTPEYRKLA QGLQRAELEG LKRTVERDAG DFSGESKDKL VQQPTVFDPP KGEGILELYD
RYAREKSGRV SADTWAQNRK VVALFDNFVG GNAHISALTR KNVREWKEKL FEWPVKAIEA
SEFRGLSFLD TIERNKVVGK PVIQHKTINR YLAALGGFSD WLLANDFIGE QIMQGMYLEV
DRRKKTVLPY SADQMRRIFE SPLFHRCGGD KLEHQKGNVE VRDWRYWIPL IAVHSGARLG
EICQLMTADV RQLHDVWIFH ITEEGGAGTK STKTEGSMRV VPMHSKLIEL GFLKYHARMS
AMGDRLFPEI KADARGYISG KASTFFNDYF RAIGVKSDRS LNFHSFRHGF ADALRRAGYY
DEQFGPLLGH TKSTTTGRYG IESEMVIADR VKMVEAVIQA K