Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3368 |
Symbol | |
ID | 3911170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3850187 |
End bp | 3852154 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637885271 |
Product | ABC transporter related |
Protein accession | YP_486975 |
Protein GI | 86750479 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5265] ABC-type transport system involved in Fe-S cluster assembly, permease and ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00530466 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTCATC CACATTCGCA TTCGCAAAGC GGCAGTCCTG CTGCGACTCC CGAGGACGCC GTCGCGCAGA AGGCGACGCT GGGCGGGACC CTGATGCATC TGTGGCCGTA TATCTGGCCG GGCGACCGGT TCGACCTGAA GATGCGGGTG GCCTGGTCGG TGGTGTTGCT GCTCGTCGCC AAGGCGGCGA CGCTGGTGGT CCCGTTCACG TTCAAATGGG CGACCGACGC GCTGACCGGC GCGGACACCG CACCGGTCGA ACCCTCGAAT TGGACGCTGT GGCTGCTGGC GTCGCCGCTG GCGCTGACGA TGAGCTACGG CCTGACGCGG GTGCTGATGG CGGTGCTGAC GCAGTGGCGC GACGGCATGT TCGCCCAGGT GGCGATGCAT GCGGTGCGCA AGCTCGCCTA TCGCACCTTC GTTCACATGC ACGAATTGTC GCTGCGGTTT CACCTCGAGC GCAAGACCGG CGGCCTGACC CGCGTGCTGG AGCGCGGCCG GCTCGGCATC GAAGTCATCG TGCGGATGGT GATCCTGCAA CTGGTGCCGA CGATCATCGA GCTGACGCTG GTGATGGCGG TGCTGCTGTG GCAGTTCGAC TGGCGCTATG TCGCGGTGAT CATGGCCACT GTCGTCGTCT ATATGTTCTA CACCTACAAG GCGACCGAAT GGCGGATCGA GATCCGCCGC CGGATGAACG ATTCCGACAG CGACGCCAAC CAGAAGGCGA TCGACTCGCT GCTCAACTAC GAGACCGTGA AGTATTTCGG CGCCGAGGAA CGCGAGGCGA AGCGCTACGA CAAGTCGATG GAGCGCTACG AAGGCGCCAG CGTCAGCACG TACACGTCGC TGGCGGTGCT CAATGCCGGG CAGGCGGTGA TCTTCACCTT CGGCCTGACC GCGACGATGC TGATGTGCGC CGTCGGAATC CGCAATGGCA CCAACACCGT CGGCGATTTC GTCATGATCA ACGCGATGAT GATTCAGTTC TATCAGCCGT TGAACTTCAT GGGCATGGTG TATCGCGAGA TCAAGCAGGC GATCATCGAC ATCGAGAAGA TGTTCGCGGT GCTGTCGCGC AATCCAGAGG TCAAGGACAA GCCGGGCGCC GAGCCGCTGG TCGTCACCAA CGGCACCGTG CATTTCGACG ACGTCCGTTT CGCCTACGAT CCGTCGCGGC CGATCCTGAA GGGCCTCAGC TTCGAGGTGC CGGCCGGCAA GACGGTCGCG ATCGTCGGTC CGTCGGGCGC CGGCAAGTCG ACGATCTCGC GGCTCCTGTT CCGGCTTTAT GACGTATCCG GCGGCCATAT CCGGATCGAT GGTCAGGACA TTCGTGACGT GACCCAGAAT TCGTTGCGGG CTGCGATCGG CATGGTGCCG CAGGATACCG TCCTGTTCAA CGACACCATC CGCTACAACA TCCGCTATGG CCGTTGGGAC GCCACCGACG AAGAGGTCGA GGAAGCGGCG AGGACGGCGC AGATCGATAC GTTCATCAAG GCGTCGCCGA AGGGCTACGA GACCGAGGTC GGCGAGCGCG GGCTGAAATT GTCCGGCGGC GAAAAGCAGC GTGTCGCGAT TGCGCGAACC GTTCTCAAGT CGCCGCCGAT CCTCGTGCTG GACGAAGCCA CTTCGGCGCT CGACAGCCAC ACCGAGCACG AAATCCAGGG CGCGCTGGAG CGTGTGTCAC AGAACCGCAC CTCGCTGGTG ATCGCGCACC GGCTGTCGAC CATTGTCGGC GCCGACGAGA TCATCGTGCT CGATCAGGGC CGGATCTCCG AGCGCGGCAC GCATGCTCAG CTGCTCGAAC ATGGCGGGCT CTATGCCAGC ATGTGGAACA GGCAGCGCGA GGCCGAAGAG GCGCGCGAGC GTCTGGCGAT GATCGGCGAC GACGTCGTCC CGAACGATTC ACCGGTTCGT TCGCCGGCGA TCGACGACGA TCTGGCAACT TCCGCGGCGG CGGAGTAA
|
Protein sequence | MAHPHSHSQS GSPAATPEDA VAQKATLGGT LMHLWPYIWP GDRFDLKMRV AWSVVLLLVA KAATLVVPFT FKWATDALTG ADTAPVEPSN WTLWLLASPL ALTMSYGLTR VLMAVLTQWR DGMFAQVAMH AVRKLAYRTF VHMHELSLRF HLERKTGGLT RVLERGRLGI EVIVRMVILQ LVPTIIELTL VMAVLLWQFD WRYVAVIMAT VVVYMFYTYK ATEWRIEIRR RMNDSDSDAN QKAIDSLLNY ETVKYFGAEE REAKRYDKSM ERYEGASVST YTSLAVLNAG QAVIFTFGLT ATMLMCAVGI RNGTNTVGDF VMINAMMIQF YQPLNFMGMV YREIKQAIID IEKMFAVLSR NPEVKDKPGA EPLVVTNGTV HFDDVRFAYD PSRPILKGLS FEVPAGKTVA IVGPSGAGKS TISRLLFRLY DVSGGHIRID GQDIRDVTQN SLRAAIGMVP QDTVLFNDTI RYNIRYGRWD ATDEEVEEAA RTAQIDTFIK ASPKGYETEV GERGLKLSGG EKQRVAIART VLKSPPILVL DEATSALDSH TEHEIQGALE RVSQNRTSLV IAHRLSTIVG ADEIIVLDQG RISERGTHAQ LLEHGGLYAS MWNRQREAEE ARERLAMIGD DVVPNDSPVR SPAIDDDLAT SAAAE
|
| |