Gene RPD_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3986 
Symbol 
ID4024503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4433598 
End bp4434791 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content65% 
IMG OID637964189 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_571106 
Protein GI91978447 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATT CAGCATCCAA TCCCGCTTGG CCGGACCACA AGCCGACCGC GCTGCTCGTG 
CTTGCCGATG GCACCGTGTT CGAGGGCTTC GGCCTCGGCG CGGAGGGCCA CGCCGTCGGC
GAGGTCTGCT TCAACACCGC GATGACCGGC TATGAGGAGA TCCTCACCGA TCCGTCCTAT
GCCGGACAGC TCATCACCTT CACCTTCCCG CATATCGGCA ATGTCGGCGC CAATGACGAG
GATATCGAGA CGGTGAACAT GGCGGCGACG CCGGGCGCGC GTGGCGTGAT CCTGCGCGGC
GCGATCACCG ACCCGTCGAA CTATCGCTCG TCGCGCCATC TCGACGCCTG GCTGAAGGCG
CGCGGCATCA TCGGCCTGTC GGGGATCGAC ACCCGGGCGC TGACCGCGCT GATCCGCAGC
AAGGGTATGC CCAATGCGGT GATCGCGCAT TCGCCCAGCG GGCAGTTCGA TCTGCACGCG
CTGAAGGAAG AAGCCCGCGA ATGGCCCGGC CTCGAAGGCA TGGACCTGGT GCCGATGGTC
ACCTCGGCGC AGCGCTTCAC CTGGGACGAG ACGCCGTGGG CCTGGGGCGA AGGCTTCGGC
CGGCAGGACA AGCCCGAATT CAACGTGGTC GCGATCGATT ACGGCATCAA GCGCAATATC
CTGCGGCTGC TCGCAGGCGA AGGCTGCAAG GTGACCGTGG TGCCGGCGAC CACCTCGGCG
GCCGACATTC TGGCGATGAA GCCGGACGGC GTGTTCCTGT CGAACGGCCC GGGCGATCCG
GCCGCGACCG GCAAATACGC GGTGCCGGTG ATCCGCGAGG TGATCTCGTC GGGCGTGCCG
ACCTTCGGCA TCTGCCTCGG TCACCAGATG CTCGGCCTCG CGCTCGGCGG CAAGACCGTG
AAGATGCATC AGGGTCACCA TGGCGCCAAT CATCCGGTCA AGGATCTCAC CACCGGCAAG
GTCGAGATCA CCTCGATGAA TCACGGCTTC GCCGTCGACA AATCGACCCT GCCGGACAAC
GTCACGCAGA CGCATATTTC GCTGTTCGAT GACAGCAATT GCGGCATCGC GCTCGCGGAC
AAGCCGGTGT TCTCGGTGCA GTATCACCCC GAGGCCTCGC CCGGCCCGCA AGACTCGCAC
TATCTGTTCC GCCGCTTCTC GGACCTGATG CGGGCGAACA AGAGCGCGGC GTAA
 
Protein sequence
MTNSASNPAW PDHKPTALLV LADGTVFEGF GLGAEGHAVG EVCFNTAMTG YEEILTDPSY 
AGQLITFTFP HIGNVGANDE DIETVNMAAT PGARGVILRG AITDPSNYRS SRHLDAWLKA
RGIIGLSGID TRALTALIRS KGMPNAVIAH SPSGQFDLHA LKEEAREWPG LEGMDLVPMV
TSAQRFTWDE TPWAWGEGFG RQDKPEFNVV AIDYGIKRNI LRLLAGEGCK VTVVPATTSA
ADILAMKPDG VFLSNGPGDP AATGKYAVPV IREVISSGVP TFGICLGHQM LGLALGGKTV
KMHQGHHGAN HPVKDLTTGK VEITSMNHGF AVDKSTLPDN VTQTHISLFD DSNCGIALAD
KPVFSVQYHP EASPGPQDSH YLFRRFSDLM RANKSAA