Gene RPD_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2074 
Symbol 
ID4022556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2323426 
End bp2325381 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content63% 
IMG OID637962267 
ProductABC transporter related 
Protein accessionYP_569210 
Protein GI91976551 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5265] ABC-type transport system involved in Fe-S cluster assembly, permease and ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACC CACATTCGCA CTCGCAGAGC GGCCCCGCGG CCGCGATTCC TGAGGACGCT 
GTCGCGCAGA AGGCGACCTT GGGCGGCACG CTGGTGCACC TGTGGCCGTA TATCTGGCCG
GGCGACCGGG TCGACCTGAA GATGCGGGTG TTGTGGTCGG TGGTGCTGCT GCTCGTCGCC
AAGGCGGCGA CGCTGATTGT TCCGTTCACG TTCAAATGGG CGATCGACGC GCTCACCGGC
GCCGACACCG CGCCGATCGA GCCTTCGAAC TGGATGCTGT GGCTGTTCGC TTCGCCGCTT
CTCCTGACGT TGAGCTACGG TCTGGTGCGG GTGCTGATGG CGTTGCTGAC GCAATGGCGC
GACGGCCTGT TCGCCCAGGT CGCGATGCAT GCGGTACGCA AGCTCGCTTA TCGCACTTTC
GTGCACATGC ATGAATTGTC GCTGCGGTTT CACCTCGAGC GCAAGACCGG CGGCCTGACG
CGGGTGCTGG AGCGCGGCCG TCTCGGCATC GAAGTGATCG TGCGCATGGT GATCCTGCAA
CTGGTGCCGA CGATCGTCGA GCTGGCGCTG GTGATGGGCG TGCTGCTGTG GCAGTTCGAC
TGGCGTTACG TCGCGGTGAT CATGGTCACC GTCGTGGTCT ACATGTTCTA TACCTACAAG
GCGACCGAGT GGCGGATCGC GATCCGGCGA CGGATGAACG ATTCCGACAG CGACGCCAAC
CAGAAAGCGA TCGACTCGCT GTTGAACTAC GAGACCGTGA AGTATTTCGG CGCCGAGGAG
CGCGAGGCGC GGCGTTACGA CAAGTCGATG GAACGCTACG AGGACGCCAG CGTCAGCACC
TATACGTCGC TCGCGGTGCT CAATGCGGGG CAGGCGGTGA TCTTCACCTG CGGTCTGACG
GCGACGATGC TGATGTGCGC CGTCGGCATC CGCAACGGCA CCAACACCGT CGGCGATTTC
GTGATGATCA ACGCGATGAT GATCCAGTTC TATCAGCCGT TGAACTTCAT GGGCATGGTG
TATCGCGAGA TCAAGCAGGC GATCATCGAC ATCGAGAAGA TGTTCGCCGT GCTGTCGCGC
AATCCCGAAG TCCAGGACAA GGCCGACGCA AAGCCGCTGG TGGTCACCGA CGGCGTGGTG
AAGTTCGAGG ATGTACGCTT CGCCTATGAC CCGTCCCGCC CGATCCTCAA GGGCCTCAGC
TTCGAGGTTC CTGCCGGCAA GACGGTTGCG ATCGTCGGGC CTTCCGGCGC GGGCAAGTCG
ACGATTTCGC GGCTGCTGTT CCGCCTGTAC GACGTCTCCG GCGGCCACAT CCGGATCGAC
GGTCAGGATA TTCGCGACGT CACTCAGACA TCGCTGCGGG CGGCGATCGG CATGGTGCCT
CAGGACACCG TGCTGTTCAA CGACACCATC CGCTACAACA TCCGCTACGG CCGCTGGAAC
GCCTCCGACG CTGAAGTCGA AGAGGCGGCG CAGACCGCGC AGATCGACGC CTTCATCAAG
GCGTCGCCGA AGGGGTACGA AACCGAAGTC GGAGAGCGCG GCCTGAAGCT GTCGGGTGGC
GAGAAGCAGC GCGTCGCGAT CGCGCGAACC GTTCTCAAGT CGCCGCCGAT CCTGGTTTTG
GACGAAGCCA CCTCGGCGCT CGACAGTCAT ACCGAGCACG AGATCCAGGG CGCGCTGGAG
CGTGTGTCAC AGAACCGCAC CTCGCTGGTG ATCGCGCACC GGCTTTCGAC AATCGTCGGC
GCCGACGAGA TCATCGTGCT CGATCAGGGC CGGATCTCGG AGCGCGGCAC GCACGCCCAA
CTGCTTGAAC ATGGCGGCCT TTACGCGAGC ATGTGGAATC GGCAGCGCGA GGCCGAAGAG
GCCCGCGAGC GTCTGGCGAT GATTGGTGAC CAGGATTCAC CGGTTCGTTC CGCCATCATC
GACGATGATC TGGCAACTTC CGCGGCGGCA GAGTAA
 
Protein sequence
MAHPHSHSQS GPAAAIPEDA VAQKATLGGT LVHLWPYIWP GDRVDLKMRV LWSVVLLLVA 
KAATLIVPFT FKWAIDALTG ADTAPIEPSN WMLWLFASPL LLTLSYGLVR VLMALLTQWR
DGLFAQVAMH AVRKLAYRTF VHMHELSLRF HLERKTGGLT RVLERGRLGI EVIVRMVILQ
LVPTIVELAL VMGVLLWQFD WRYVAVIMVT VVVYMFYTYK ATEWRIAIRR RMNDSDSDAN
QKAIDSLLNY ETVKYFGAEE REARRYDKSM ERYEDASVST YTSLAVLNAG QAVIFTCGLT
ATMLMCAVGI RNGTNTVGDF VMINAMMIQF YQPLNFMGMV YREIKQAIID IEKMFAVLSR
NPEVQDKADA KPLVVTDGVV KFEDVRFAYD PSRPILKGLS FEVPAGKTVA IVGPSGAGKS
TISRLLFRLY DVSGGHIRID GQDIRDVTQT SLRAAIGMVP QDTVLFNDTI RYNIRYGRWN
ASDAEVEEAA QTAQIDAFIK ASPKGYETEV GERGLKLSGG EKQRVAIART VLKSPPILVL
DEATSALDSH TEHEIQGALE RVSQNRTSLV IAHRLSTIVG ADEIIVLDQG RISERGTHAQ
LLEHGGLYAS MWNRQREAEE ARERLAMIGD QDSPVRSAII DDDLATSAAA E