Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4370 |
Symbol | |
ID | 6412054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4696164 |
End bp | 4698122 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714252 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_001993341 |
Protein GI | 192292736 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGG GCAAAGCCCA ATCCCTGGAA GACGTCGCCG GTCTGGTCGG CGCCTGCGCG GTCACGATCC TGGCCGTCAT CATCATCTCG GCGCTGTATG TCGGCCGAGA GGTGTTCGTC CCGGTCGCCC TGGCGATCCT GCTGAGCTTC GTGCTGGCTC GCCCGGTCAA CTTCCTGCAG TCACTGCGGG TGCCGCGGGC AATCGCGGCG ATCACCACCG TCCTTTTCGC CTTCGCGGTG ATCTTCGCGC TCGGCAGCCT GATCGCGACG CAGCTGTCGC GGCTGGCCGA CGACCTGCCG CAGTACCAAT CGACGATCCA ATCCAAGATC ACCTCGCTGC GCGGCGTGAC CGGCGGCTCC ACGACGCTCG AACGCGCCGA GGGGATGCTG CAGAATCTCA GCAAGGAACT GAACAAACCG AAGAACGCGC CCGCGCCGTC GCTGAGCAAT CCGCCGACGA CGTCGTCGAG ACCGGTCACC CCCGTTCCGG TCGAAGTCCT GCAGCCCGAC CCGGGGACCT TGGCGAACCT GCGATCTCTG ATCGCGCCGC TGATCTCGCC GCTGGCGACG ACCGGCATCA TCGTGATCTT CGTCATCTTC ATCCTGTTGC AGCGGGAGGA CCTGCGCAAT CGGCTGATCC GGCTCGCCGG CACCCGCGAT CTGCAGCGCA CGACCGCGGC GCTGGATGAT GCCGCCAGCC GGCTGAGCCG CTTGTTCCTC AATCAGCTGC TGATCAACTC CGGCTTCGGC GTGCTGATCG GCACCGGGTT GTGGATCATC GGCGTGCCGA GCCCGGCGCT GTGGGGCATT CTCGCCGCGG TGCTGCGCTT CGTGCCGTAT ATCGGATCGA TCATCTCAGC GGCCTTCCCA CTGACCCTGG CGGTCGCGGT CGATCCCGGC TGGTCGATGC TGGTGTGGAC GGCGATCCTG TTCTTCGTGA TCGAACCGGC GATCGCCCAT GTCGTCGAGC CGATGGTGTA CGGCCGTAGT ACCGGGCTGT CGCCGGTCGC CGTGGTGATC TCGGCGACGT TCTGGACGGC GCTGTGGGGC CCGATCGGCC TCGTTCTCGC CACGCCGCTG ACGGTGTGTC TCGTCGTGCT CGGGCGGCAC GTCGAGCGGT TGGCGTTTCT CGACGTAATG TTCGGTGATC GGCCGGCGCT ATCGCCGCCG GAGATCTTCT ATCAGCGCAT GCTGGCCGGC GACCCGGCCG AAGCCGCCGA GAAGGCCGAG CAATTTCTCA AAGAACGGTC GCTGTCGTCG TATTACGACG ACGTCGCCCT GAAAGGCCTG CAACTAGCCC AGGCCGACCT CGATCGCGAC GCACTCGACG CCGTGCGCCT GACGCGGATC AAGGAGACGG TGCAGGAGTT CACCGAGGAC CTCACGGACG AAATCGATCA GGCGCCGGAC GGCGACGAAG CCACCACCGA CGCCGAGGCT GCTGCCGCCG TCGAAGTGAC GCCGGTCGAT CACGCCGACG ACGACATCGC AGTGCTGAAG CCCGCCGACC TCAAGCCTGG ATGGCAAGGC GCCGCACCGG TGATGTGCAT CGGCGGACGG TCGCAATTGG ACGAAGCCGC GGCGCTGATG CTGGCGCATT TGTGCCGCGT GCACGGCATC GGCGCCCGTG TCGAGCCATC GAGCGCGCTG TCCACCAAGA ACATCTTTGG CCTCGACGTC TCGAACGTCG CGCTGATCTG CCTGTCGTAT CTCGAGGCGT CGAACACGAC CCATATCCGC TACGCCGTCC GCCGCCTGCG TCGCAAGGCG CCGCACGCCA AGATCATCGT CGCTTTGTGG TCGGCGGAGA CTCCGCAACT GGCCGATACC AACGAATCCG CGCAGGCCGA CGCGACGGTG CTGACGCTGC GGGACGCCGT GAAATACTGC GTCGAAGAGG CGATCATTGA GCCGCCGCCA CAGACGATCG AAATGCCGGT GATCAGCGAG GCTGTGTAG
|
Protein sequence | MKLGKAQSLE DVAGLVGACA VTILAVIIIS ALYVGREVFV PVALAILLSF VLARPVNFLQ SLRVPRAIAA ITTVLFAFAV IFALGSLIAT QLSRLADDLP QYQSTIQSKI TSLRGVTGGS TTLERAEGML QNLSKELNKP KNAPAPSLSN PPTTSSRPVT PVPVEVLQPD PGTLANLRSL IAPLISPLAT TGIIVIFVIF ILLQREDLRN RLIRLAGTRD LQRTTAALDD AASRLSRLFL NQLLINSGFG VLIGTGLWII GVPSPALWGI LAAVLRFVPY IGSIISAAFP LTLAVAVDPG WSMLVWTAIL FFVIEPAIAH VVEPMVYGRS TGLSPVAVVI SATFWTALWG PIGLVLATPL TVCLVVLGRH VERLAFLDVM FGDRPALSPP EIFYQRMLAG DPAEAAEKAE QFLKERSLSS YYDDVALKGL QLAQADLDRD ALDAVRLTRI KETVQEFTED LTDEIDQAPD GDEATTDAEA AAAVEVTPVD HADDDIAVLK PADLKPGWQG AAPVMCIGGR SQLDEAAALM LAHLCRVHGI GARVEPSSAL STKNIFGLDV SNVALICLSY LEASNTTHIR YAVRRLRRKA PHAKIIVALW SAETPQLADT NESAQADATV LTLRDAVKYC VEEAIIEPPP QTIEMPVISE AV
|
| |