Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4142 |
Symbol | |
ID | 3911950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4716515 |
End bp | 4717708 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886046 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_487745 |
Protein GI | 86751249 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.164526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAATT CAGCATCCAA TCCCGCCTGG CCGGACCACA AACCGACCGC GCTGCTCGTG CTCGCCGATG GCACCGTGTT CGAGGGCTTC GGCCTCGGCG CGGAGGGCCA CGCCGTCGGC GAGGTCTGCT TCAACACCGC GATGACCGGC TATGAGGAGA TCCTCACCGA CCCGTCCTAT GCCGGCCAGC TCATCACCTT CACCTTCCCG CATATCGGCA ATGTCGGCAC CAACGACGAG GATATCGAGA CGGTGAACAT GGCGGCGACG CCGGGCGCGC GCGGCGTGAT CCTGCGCGAC GCCATCACCG ACCCGTCGAA CTATCGCTCG TCGCGGCATC TCGACGGCTG GCTGAAAGCG CGCGGCATCA TCGGCCTGTC GGGCATCGAC ACCCGCGCCC TGACCGCGCT GATCCGCGAC AAGGGCATGC CGAATGCGGT GATCGCCCAT GCGCCGGACG GCAAGTTCGA CTTGCACGCG CTGAAGGAAG AAGCCCGCGA ATGGCCCGGC CTCGAAGGCA TGGACCTGGT GCCGATGGTG ACCTCGGCGC AGCGCTTCAG CTGGGACGAG ACGCCGTGGG CCTGGGGCGA AGGCTTCGGC CGGCAGGACA ATCCGGAATT CCACGTCGTG GCGATCGACT ACGGCGTCAA GCGCAACATC CTGCGGCTGC TCGCCGGCGA AGGCTGCAAG GTCACGGTGG TGCCGGCGAC CACCTCGGCC GACGACATCC TGGCGCTGAA GCCGGACGGC GTGTTCCTGT CGAACGGCCC GGGCGACCCG GCGGCAACCG GCAAATACGC GGTGCCGGTG ATCCAGCAGG TGATCAGTTC CGGCGTGCCG ACCTTCGGCA TTTGTCTCGG CCACCAGATG CTCGGCCTCG CGCTCGGCGG CAAGACCGTG AAGATGCATC AGGGCCACCA CGGCGCCAAT CATCCGGTCA AGGATCTCAC CACCGGCAAG GTCGAGATCA CCTCGATGAA CCACGGCTTC GCGGTCGACA AGACCACGCT GCCGGCCAAT GTGCAGCAGA CCCACGTGTC GCTGTTCGAC GACAGCAATT GCGGCATCGC GCTGTCCGAC CGGCCGGTGT TCTCGGTGCA GTACCACCCG GAAGCCTCGC CGGGCCCGCG CGACTCGCAT TATCTGTTCC GCCGGTTCTC GGACCTGATG CGGGCGAAGA AGAGCGCGGC GTAA
|
Protein sequence | MTNSASNPAW PDHKPTALLV LADGTVFEGF GLGAEGHAVG EVCFNTAMTG YEEILTDPSY AGQLITFTFP HIGNVGTNDE DIETVNMAAT PGARGVILRD AITDPSNYRS SRHLDGWLKA RGIIGLSGID TRALTALIRD KGMPNAVIAH APDGKFDLHA LKEEAREWPG LEGMDLVPMV TSAQRFSWDE TPWAWGEGFG RQDNPEFHVV AIDYGVKRNI LRLLAGEGCK VTVVPATTSA DDILALKPDG VFLSNGPGDP AATGKYAVPV IQQVISSGVP TFGICLGHQM LGLALGGKTV KMHQGHHGAN HPVKDLTTGK VEITSMNHGF AVDKTTLPAN VQQTHVSLFD DSNCGIALSD RPVFSVQYHP EASPGPRDSH YLFRRFSDLM RAKKSAA
|
| |