Gene Rpal_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4037 
Symbol 
ID6411720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4329098 
End bp4330786 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content71% 
IMG OID642713919 
ProductDNA repair protein RecN 
Protein accessionYP_001993008 
Protein GI192292403 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.531713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGCCC GCCTGTCGAT CCGCGATATC GTGTTGATCG AGCGGCTCGA CATCGAGTTT 
TCCCGTGGCC TTGCGGTGCT GACCGGCGAG ACCGGCGCCG GCAAATCGAT TCTGCTCGAT
GCCTTCGCGC TGGCGCTCGG CGGCCGCGGC GATGCCGCGC TGGTCCGCCA CGGCGCCGCC
GAGCACGGCC AGGTCACCGC CAGCTTTGAT GTCGCTAAGA CCCATCCAGC GTTCGCGATT
CTCAAGGCCA ATGGTCTCGA CGACCGTGAG GTCGACGAAT CCGGCGAATT GATCCTGCGC
CGCGTCCAGC TCGCCGACGG CCGCACCCGC GCCTTCATCA ACGACCAGTC GGTCAGCGTG
CAGACCCTCA AGGCGGTCGG CGCGACGCTG GTCGAGATCC ACGGCCAGCA CGACGAGCGC
GCGCTGGTCG ACGCCGCCAC CCATCGCCGG CTGCTCGACG CCTTCGCAGG CCTTGAGAAG
GACGTCGTTT CTCTTGAGGC GCTGTGGGAG GGCCGCCGCA CCGCGCGGGC CGCACTCGAC
GCCCATCGCG CCGGCATGGA GCGCGCGGCG CGCGAGGCCG ACTACCTGCG CCATGCCGCC
GACGAACTGA AGCAGCTCGC GCCGCAGGAC GGCGAGGAGA CCTCGCTGGC CGAGCGTCGC
ACCACCATGA TGCAGGGCGA GAAGATCGCC GCCGACCTGC GCGAGGCGCA GGAGGTTGTC
GGCGGGCATC ATTCGCCGGT CGCCGCGCTG GCCTCCGCGG TGCGCCGGCT GGAGCGCCGC
GCCGGCACCG CGCCGCAGCT GATCGAGCCC GCCGTGCGCG CGATCGACGC CGCCATCAAC
GCGTTGGAAG AAGCCGACCA GCATCTCAAC GCCGCGCTCG CCGCAGCCGA TTTCGACCCG
TTGGAACTGG AGCGGATCGA GGAGCGGTTG TTCGCGCTGC GCGCCGCCGC CCGCAAGTAT
TCGACCCCGG TGGATTCGCT CGCCGCGCTC GCCGCGCAAT ACGTCGCCGA TGTCGCGCTG
ATCGATGCCG GCGCCGACCG GCTGGTGGCG CTGGAGAAGG CCGCGGCCGA AGCCGACGCC
CGCTACGGCG CCGCCGCGGC GAAGCTGTCG GCCGCGCGCG CCAAGGCCGC CGACAAGCTC
AACAAGGCGG TCGGCGCAGA GCTGGCGCCG CTCAAGCTCG AACGCGCCAA GTTCATGACC
CAGGTCGAGG CCGACGAGGC CGCGCCGGGC CCGCAGGGCA TCGACCGCGT CGAATTCTGG
GTGCAGACCA ATCCCGGCAC GCGCCCCGGC CCGTTGATGA AGGTGGCGTC GGGCGGCGAG
CTGTCGCGCT TCCTGCTGGC GCTGAAAGTG GTGCTGTCCG ACAAGGGCTC GGCGCCGACT
TTGGTGTTCG ACGAGATCGA CACCGGCGTC GGCGGCGCGG TCGCGGACGC GATCGGCGCC
CGGCTGGCGC GGCTGGCCTC GAAGGTCCAG GTGATGGCCG TGACCCACGC TCCCCAGGTC
GCGGCGCGTG CCGATCAGCA TCTGCTGATC TCCAAGGCCG CCCTCGACAA GGGCAAACGC
GTCGCCACCC GCGTCGCCGC CCTGGAACAG GACCACCGCC GCGAAGAAAT CGCCCGCATG
CTGGCTGGTG CCGAGATCAC CGCCGAGGCG AGGGCTGCGG CGGACCGGCT GATCAAGGCG
GCGGGGTAG
 
Protein sequence
MLARLSIRDI VLIERLDIEF SRGLAVLTGE TGAGKSILLD AFALALGGRG DAALVRHGAA 
EHGQVTASFD VAKTHPAFAI LKANGLDDRE VDESGELILR RVQLADGRTR AFINDQSVSV
QTLKAVGATL VEIHGQHDER ALVDAATHRR LLDAFAGLEK DVVSLEALWE GRRTARAALD
AHRAGMERAA READYLRHAA DELKQLAPQD GEETSLAERR TTMMQGEKIA ADLREAQEVV
GGHHSPVAAL ASAVRRLERR AGTAPQLIEP AVRAIDAAIN ALEEADQHLN AALAAADFDP
LELERIEERL FALRAAARKY STPVDSLAAL AAQYVADVAL IDAGADRLVA LEKAAAEADA
RYGAAAAKLS AARAKAADKL NKAVGAELAP LKLERAKFMT QVEADEAAPG PQGIDRVEFW
VQTNPGTRPG PLMKVASGGE LSRFLLALKV VLSDKGSAPT LVFDEIDTGV GGAVADAIGA
RLARLASKVQ VMAVTHAPQV AARADQHLLI SKAALDKGKR VATRVAALEQ DHRREEIARM
LAGAEITAEA RAAADRLIKA AG