Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0468 |
Symbol | |
ID | 3909813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 514959 |
End bp | 516500 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637882355 |
Product | hypothetical protein |
Protein accession | YP_484090 |
Protein GI | 86747594 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.101632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.77905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGA TCGCCACGTC CCGCGCCCGC CTGTCGCTGA TCGCGCGCTG GATCGACGCC CTGCTCGATC CCAAACGGCA GGAGCGTACC GTCATCCTGT CGCTGGCGGT CTATGCGGCG ATCTGGACCG CGTACCGCAC CATCGCCACC TGGCCGCGCG ACCTGCACGC CGACGAAACC GAGCTGTACG CCTGGTCGCA GCATCTGGCG TTCGGCACTG ACAAGCATCC GCCGTTCTCG GCCTGGGTTG CACGCGCCTG GTTCAGTGTC GTGCCGGTGT CGGATCTGAC GTTCCATCTG CTGGCCACCG TCAACATCGC CGTCACGCTG TACATCGCCT GGCGGACGAT GCGGCGCTAT ATGACGGCCG AGAAGGCGCT GTTCGGGCTG GCGACACTGA CGCTGATCCC GTTCTTCAAT TTCATCGCGC TGAAATACAA CGCCAATGCG GTGCTGCTGC CGCTGTGGGC GCTGACCATC CATGGTTTTC TGCGCGCGTT CGAACAGCGC GGCTGGCTGT GGCCGACGCT GGCGGGGGTG TTCGCCGGCG CGTCGATGCT GGGCAAATAC TGGTCGATCG TGCTGGTCGG TTCGCTCGGG CTCGCCGCGC TGCTGGATCG GCGGCGGGCG CGGTTCTTCG CCTCCCCGGC GCCGTGGCTG ATGATCGTCG CGGGCGGGCT GGTGCTGGCG CCGCATGTCG CCTGGCTGGT CGAGCACCGC TTCCCGACCT TTGCCTATGC GGCGGCGCGG GAAGCCGACG GCCTCGGGCA CAATGCGCTC GACACGCTAC GCTATCTCGC CGGCTGTGTC GGCTATGCGG CGCTGGCGCT GATCGCCACC TGGCTGCTGC TGCGGCCGTC GCGCGCGGCA TTGATCGAGA GCGTCTGGCC GGCGGACCCG CAGCGCCGGC TGATCGTCAC GATCCAGGTG CTGATGATCG TGGCGCCGGC GCCGGTGGCG CTCGTGACCG GCATCCGCAT CGTGCCGCTG TGGACGATGC CGGCCTGGAC GCTGCTGCCG ATCGTGCTGC TGTCGTCGCC GCTGATCGCG GTCGGCCGCG ACGCGCTGCG GCGGATGCTG ATCGGGGCCG CGGCGCTGGC GCTGACGATC CTCGCCGCGG CGCCGGGCGT GGCGGTGGCG ATCCACAGCA GCAGCCCGCC GGAGCCGTTC GAATACGCTT CCTTGCTCGC CGACGACATC GCGCGGGTCT GGCAGCGCCA CACCGACAGG CCGATCGCAC TGGTGGCGGG CGAAACCGTG CTGGCGCAGA ACACCGCGTA TTATCTGCGC ACCGACAGCC GCGCCTTCGC GACCGCCGAT CTGGCGACGC TGAAAGCCGA CGCCGCCGCG CGCGGCGCGG CGCTGGTGTG CCCGGCGGCG GATCAGTCCT GCCTGTCGGT CGCCGAGCAG ATCGTCGCGG CGCAGCCGCA GATCCTGCGC AGCAAGGTCT GGCTCAGCCG GCCGCTGCTC GGGATCGCCG GCGGCACGGT GCAGGACGTG TTCTTCCTGG TGCTGCCGCC ATCAGCGACG GGGAAGACGT AG
|
Protein sequence | MTSIATSRAR LSLIARWIDA LLDPKRQERT VILSLAVYAA IWTAYRTIAT WPRDLHADET ELYAWSQHLA FGTDKHPPFS AWVARAWFSV VPVSDLTFHL LATVNIAVTL YIAWRTMRRY MTAEKALFGL ATLTLIPFFN FIALKYNANA VLLPLWALTI HGFLRAFEQR GWLWPTLAGV FAGASMLGKY WSIVLVGSLG LAALLDRRRA RFFASPAPWL MIVAGGLVLA PHVAWLVEHR FPTFAYAAAR EADGLGHNAL DTLRYLAGCV GYAALALIAT WLLLRPSRAA LIESVWPADP QRRLIVTIQV LMIVAPAPVA LVTGIRIVPL WTMPAWTLLP IVLLSSPLIA VGRDALRRML IGAAALALTI LAAAPGVAVA IHSSSPPEPF EYASLLADDI ARVWQRHTDR PIALVAGETV LAQNTAYYLR TDSRAFATAD LATLKADAAA RGAALVCPAA DQSCLSVAEQ IVAAQPQILR SKVWLSRPLL GIAGGTVQDV FFLVLPPSAT GKT
|
| |