Gene RPB_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4142 
Symbol 
ID3911950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4716515 
End bp4717708 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content67% 
IMG OID637886046 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_487745 
Protein GI86751249 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.164526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATT CAGCATCCAA TCCCGCCTGG CCGGACCACA AACCGACCGC GCTGCTCGTG 
CTCGCCGATG GCACCGTGTT CGAGGGCTTC GGCCTCGGCG CGGAGGGCCA CGCCGTCGGC
GAGGTCTGCT TCAACACCGC GATGACCGGC TATGAGGAGA TCCTCACCGA CCCGTCCTAT
GCCGGCCAGC TCATCACCTT CACCTTCCCG CATATCGGCA ATGTCGGCAC CAACGACGAG
GATATCGAGA CGGTGAACAT GGCGGCGACG CCGGGCGCGC GCGGCGTGAT CCTGCGCGAC
GCCATCACCG ACCCGTCGAA CTATCGCTCG TCGCGGCATC TCGACGGCTG GCTGAAAGCG
CGCGGCATCA TCGGCCTGTC GGGCATCGAC ACCCGCGCCC TGACCGCGCT GATCCGCGAC
AAGGGCATGC CGAATGCGGT GATCGCCCAT GCGCCGGACG GCAAGTTCGA CTTGCACGCG
CTGAAGGAAG AAGCCCGCGA ATGGCCCGGC CTCGAAGGCA TGGACCTGGT GCCGATGGTG
ACCTCGGCGC AGCGCTTCAG CTGGGACGAG ACGCCGTGGG CCTGGGGCGA AGGCTTCGGC
CGGCAGGACA ATCCGGAATT CCACGTCGTG GCGATCGACT ACGGCGTCAA GCGCAACATC
CTGCGGCTGC TCGCCGGCGA AGGCTGCAAG GTCACGGTGG TGCCGGCGAC CACCTCGGCC
GACGACATCC TGGCGCTGAA GCCGGACGGC GTGTTCCTGT CGAACGGCCC GGGCGACCCG
GCGGCAACCG GCAAATACGC GGTGCCGGTG ATCCAGCAGG TGATCAGTTC CGGCGTGCCG
ACCTTCGGCA TTTGTCTCGG CCACCAGATG CTCGGCCTCG CGCTCGGCGG CAAGACCGTG
AAGATGCATC AGGGCCACCA CGGCGCCAAT CATCCGGTCA AGGATCTCAC CACCGGCAAG
GTCGAGATCA CCTCGATGAA CCACGGCTTC GCGGTCGACA AGACCACGCT GCCGGCCAAT
GTGCAGCAGA CCCACGTGTC GCTGTTCGAC GACAGCAATT GCGGCATCGC GCTGTCCGAC
CGGCCGGTGT TCTCGGTGCA GTACCACCCG GAAGCCTCGC CGGGCCCGCG CGACTCGCAT
TATCTGTTCC GCCGGTTCTC GGACCTGATG CGGGCGAAGA AGAGCGCGGC GTAA
 
Protein sequence
MTNSASNPAW PDHKPTALLV LADGTVFEGF GLGAEGHAVG EVCFNTAMTG YEEILTDPSY 
AGQLITFTFP HIGNVGTNDE DIETVNMAAT PGARGVILRD AITDPSNYRS SRHLDGWLKA
RGIIGLSGID TRALTALIRD KGMPNAVIAH APDGKFDLHA LKEEAREWPG LEGMDLVPMV
TSAQRFSWDE TPWAWGEGFG RQDNPEFHVV AIDYGVKRNI LRLLAGEGCK VTVVPATTSA
DDILALKPDG VFLSNGPGDP AATGKYAVPV IQQVISSGVP TFGICLGHQM LGLALGGKTV
KMHQGHHGAN HPVKDLTTGK VEITSMNHGF AVDKTTLPAN VQQTHVSLFD DSNCGIALSD
RPVFSVQYHP EASPGPRDSH YLFRRFSDLM RAKKSAA