Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4017 |
Symbol | |
ID | 6411700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4302706 |
End bp | 4306530 |
Gene Length | 3825 bp |
Protein Length | 1274 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 642713899 |
Product | AsmA family protein |
Protein accession | YP_001992988 |
Protein GI | 192292383 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.43483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGACGA CGCTGCTCGG CCTGGCTATC GCGCTGATCC TGGCGCTGCT CGCCGCGCTG GTCGGGCCGT ATCTGATCGA CTGGAACCAG TTTCGCACTC CGTTCGAAGC GGAGGCGACG CGGCTGCTCG GCGCCCAGGT TCGGGTTGCC GGTGCGCTCG ATGCGCGGCT GCTGCCGACG CCGTCGCTGC GACTGCAGCG TGTGACCGTC GGCGGCGCCA ACGATGCGGG CCGGTTCGCC GCCGCCAAGC TCGATGTCGA ATTCAGCCTC GGCTCGCTGA TGCGCGGCGA ACTGCGCGCC GACGAACTGT CGATCAACGG GCTGTCGCTC GATCTCGGGC TCGATCCGCA GGGCCGGCTT GACTGGCCGG CGGTGGGCTT CTCGAACCTC GCCGCGCTCA CCGTCGACCG GCTCAATCTC ACCGGCCGGA TCGCGCTGCA CGATGCCGCC AGCCGCAGCA ATCTCGAACT CAGCGACATC GCGTTCTCCG GCGAGCTGCG CGCGCAAGCC GCGGCGCTGC GCGGCGACGG CAATTTCCTG CTCGGCGGCC AGCGCTATCC GTTCCGGCTG TCGACCAGCC GCGCGGCGGA TGGCGATGCG ATCCGGCTGC GCGCCGGGAT CAACCAGGGC GCCGGCGGGA TCGCGGCCGA TCTCGACGGC TTGGTGAGGT TCGGGCAAAG CCGGCCCCTG TTCGACGGCA CAGCTTCGCT GGCGGCGCTT GCGGCAGCGA CGGGCGAGAC GCCGCGGACG CCGTGGCGGC TCGCCGCCAA GGTCGAGGCG GATCCGGCGC TCGCCAAGCT CGATCAGGTC GAGGTCTCCT GGGGCGCTGA CGACCGCGCG CTGAAGCTGA CCGGCAGCGG CGAAGCGCGG TTCGGCGCTG CGCCGCGGCT CGACGCCAAG CTGTCGGCGC GGCAGCTCGA TGCCGACCGG CTGCTGGCGA AGCCCGGCGA TGTCGCCCAG CCCTTGCCCT GGCTGGTGCG GCTGCGCGAT CTGGCGGAGT CGCTGCCGCT GCCGTCGTTG CTGGCGCGGG TGTCGCTCGA CGCCGATCAG ATCATGATCG GCAACCGGCC GGTCACCGAT CTCGTCGCCG AACTGCGCGG CGACTCGCAG ACCTGGCAGC TCGAGCGGCT GGAGCTGCGT GCCCCCGGCG GCACCCGGCT GGCGCTGCGC GGCGGCGCTG GCGCCGCGGA TGCAGCGATC ACCGGCACGC TCGATCTGTC CGCGACCGAT CCGGATCTGC TCGCCGCTTG GCTGCAGGGC CGCGCGCCGA CCGCCGGCAG TCTCAGCAAG GCGCTGCGGA TCGCCGGTGG TGTCCGCGCC GGCTCCGACG CACTGGTGCT CGATCCGCTG ACGGTGCAAA GCGGCGGCGA CACGCTGCGC GGTCGCTTCG CCTATCGGGC GCGGCGTGAT GATCAGGGGA CGCAGATCGA CGCTGAGCTG AAGGGCGACA GCGCCGATCT CGACGCGGCG CTGCAATGGG CGCGCACGCT CGCCGGATCT GACGGCGGCT GGCCGGAGCA GGCCAGCCTC ACGCTCGACC TCGGCAAGCT CAGCGTCGGC GATCAGGTCT GGCAGCCCGC TGCACTGCGG CTGGCGTACG ATCCGCACCG GATCGCGCTC GACCGGCTGA AGCTCGGCAC CGCGGGCGGG CTGCTGTTGG AAGGCGACGG CGCCCTCGAC CGCGACAACG CCACCGGCAG GCTGGCGCTG TCGACGCGGG CGCCATCGCT CGGTCCGATC GCCGCGGCGA TCGATCCAAT CGCGCCGGCC ATCGCGGCAC GATTGAAGGC GCTGCCGACC GACCCCGGCG AAGTACGCGC CAAGCTGGCG TTGGCGGTGG AGAAGGCCGG TGCGGTGCGC GATCACGTCG AAGCCAGCGG CGCGCTCGAC CTTGCCGGCC CGCAACTAAG TGGTCGTTTG ACCGCAAAGG CGAGCCCGCC TGCTGCTGAT CTGCGCAAGC TCGATGTCGA GGCGCTGACG CGTCGTGCGC TCCAACTCGA CGGGGAGCTG TCGGCACCGC GTGGTGAGGC GCTGCTGTCG TTGCTCGGGC TCGACCGCGT CATCGCCGCC GGCGACGGCG CGGCGACGCT GCACGCGTCC GGCTCCGGCA CGTGGCGAGG CAAGCTGCAA GGCAAAGCCA AGCTGACCGC GGCGCGGCTC GACGCCGAGG CGAGCGGCGA GGTCGAGCCG TTCGCGGCCG AGCCGAAGGC GTCGCTGTCG CTCAATGCGC GCAAGCTCGA CCTGGCGCCG CTGTTCGATC TGCCGCCGTC CGTCGCAAAC GAAGCGCCGC TGTCGCTCTC GACCCAGCTG GCGGTCGCGG GCAGCAAATG GACCTTCAAA GACATCGATG CCGGCATCGC CGGCTCGCGG CTGCGCGGCC GGTTGGCGCT GACCCGGGGC GAGACCGCCG AACTCGACGG CGAAGCGGGG ATCGACACGC TGCCGCTGGG ACCGGCGCTG CAACTCGCGC TCGGAGCCGC CGGTCGTCCC GCCGACGAGC CGCTCGGGCA GGGCTGGCTG CGCGGTTGGC GCGGTAAGAT CGCGTTCCAG GCGGTGCGCG CCGAACTGCC GGGCGGCAGC GAACTGCAAC CGCTGCGCGG GGTGGTGCGC AGCGACGGGC GATCGCTGAC GCTCGACAGC AGTGGCAAGC TTGGCGGCGG CGACGCCAAG GCGGTGCTGA CCGCCAAACT GGTCGAGGCC GGTGTGGCGC TCGACGCGCA GCTCACCTTG AAGGATGCCG ACGCCGCAGC GCTGCGCGAC GGCACGCTTG CCCTGCCGCC GGGGAGGCTG TCGCTGCAAG GAACGCTGGC GAGCAGCGGT CGCAGCGCCT CGGCGCTCGC CGGCGCGCTG TCCGGCGGCG GCACAGTGAC GCTGTCGCAG GCGGCGATCT CAGGGCTCGA TCCGAGCGCC TTTGCCGTCG CGATCCGCGC CGCCGATCAA GGACAGCCGA TCGATGCCGA CCATCTGGCC AAGCTGATCG AGCCGGCGCT GAAGGCCGGG CCGCTGAAGG TCGACAGCGC GCAGTTTCCG ATCAGTGTCG GCGACGGCCG CCTGAAGCTG GCGCCGACCA CGCTGCAGGC CAAGGACGCG CGCGCGGTAG TGTCGGGGGG CTACGATGTG GCGGCGGGGC AGGCCGATGT GCGGGTGACG CTGATTTCGA CCGAGCCGGC CCAGGAGTTG CCACCGGAGA TCCGGGTGTT CGCGGCCGGA CCGCCGGACC GGATGGAGTG GTCTGTCGAT CTGTCAGGAT TGTCGTCGTG GCTGTCGATC CGCCGGATCG ATCGCGAGAC CCGCAAGCTG CAGATGCTGG AACAGGGGGG CAAACCGCCG GTCGGCCCTG ATCCGGCCGC CGCGACTCAG GCGCCAAGCA ATGCCGACAA GTCTCCGGCC CAGACCTCGA GCCTGCCGCC GCCAGCCGCA GCAGCCATGC CACAGTCCGC GCCCACGCCG ACTCGCGTGC CAGCGGGCAA TCATCAGACC GTGCCCAAAA GCGCGCCCGA GGCTGCGCCG CAGCCAAGTG CTCCCGCCAA CGGGCAGGAT GCCGCGATCA GTCCGACGCA GCCGTTTACG CCGCCGTTGC CGGACGCCGA TCCGCGTCGC GCTACACCAC CGAAGCCTCG CGTCCAGCCG CAACGACCGT CGCAACCTCA GCCGCAAACG CCACTTCAGC CACAAGCATC GACCGCCGCG TCTTCGACAT TGCGCGAAAA GCTCGCGCCA CTGCCGCCGC CTTTGGAGAT CAAACCTGCA CCAGGAGACA CGCGCCCGTC ACGGCCACGT CCGCCCTTGG TGTTGTCGCC CCCGAATGCG AGGGCGACCG ACTGA
|
Protein sequence | MQTTLLGLAI ALILALLAAL VGPYLIDWNQ FRTPFEAEAT RLLGAQVRVA GALDARLLPT PSLRLQRVTV GGANDAGRFA AAKLDVEFSL GSLMRGELRA DELSINGLSL DLGLDPQGRL DWPAVGFSNL AALTVDRLNL TGRIALHDAA SRSNLELSDI AFSGELRAQA AALRGDGNFL LGGQRYPFRL STSRAADGDA IRLRAGINQG AGGIAADLDG LVRFGQSRPL FDGTASLAAL AAATGETPRT PWRLAAKVEA DPALAKLDQV EVSWGADDRA LKLTGSGEAR FGAAPRLDAK LSARQLDADR LLAKPGDVAQ PLPWLVRLRD LAESLPLPSL LARVSLDADQ IMIGNRPVTD LVAELRGDSQ TWQLERLELR APGGTRLALR GGAGAADAAI TGTLDLSATD PDLLAAWLQG RAPTAGSLSK ALRIAGGVRA GSDALVLDPL TVQSGGDTLR GRFAYRARRD DQGTQIDAEL KGDSADLDAA LQWARTLAGS DGGWPEQASL TLDLGKLSVG DQVWQPAALR LAYDPHRIAL DRLKLGTAGG LLLEGDGALD RDNATGRLAL STRAPSLGPI AAAIDPIAPA IAARLKALPT DPGEVRAKLA LAVEKAGAVR DHVEASGALD LAGPQLSGRL TAKASPPAAD LRKLDVEALT RRALQLDGEL SAPRGEALLS LLGLDRVIAA GDGAATLHAS GSGTWRGKLQ GKAKLTAARL DAEASGEVEP FAAEPKASLS LNARKLDLAP LFDLPPSVAN EAPLSLSTQL AVAGSKWTFK DIDAGIAGSR LRGRLALTRG ETAELDGEAG IDTLPLGPAL QLALGAAGRP ADEPLGQGWL RGWRGKIAFQ AVRAELPGGS ELQPLRGVVR SDGRSLTLDS SGKLGGGDAK AVLTAKLVEA GVALDAQLTL KDADAAALRD GTLALPPGRL SLQGTLASSG RSASALAGAL SGGGTVTLSQ AAISGLDPSA FAVAIRAADQ GQPIDADHLA KLIEPALKAG PLKVDSAQFP ISVGDGRLKL APTTLQAKDA RAVVSGGYDV AAGQADVRVT LISTEPAQEL PPEIRVFAAG PPDRMEWSVD LSGLSSWLSI RRIDRETRKL QMLEQGGKPP VGPDPAAATQ APSNADKSPA QTSSLPPPAA AAMPQSAPTP TRVPAGNHQT VPKSAPEAAP QPSAPANGQD AAISPTQPFT PPLPDADPRR ATPPKPRVQP QRPSQPQPQT PLQPQASTAA SSTLREKLAP LPPPLEIKPA PGDTRPSRPR PPLVLSPPNA RATD
|
| |