Gene RPD_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1085 
Symbol 
ID4021561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1235029 
End bp1236252 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID637961277 
Productaminotransferase, class V 
Protein accessionYP_568224 
Protein GI91975565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA CCCGCGTCCC GCCGCAGCCG GCGAATATCT GCTATCTCGA CGCCAACGCC 
ACCACGCGCA CCGATCCGCG GGTGGTTGAT GCGATGCTAC CGTTCTTCTC CGGCTATTTC
GGCAATCCGT CGTCGAAGCA CGCGCTCGGC GCCCACGCCG CGATGGCGGT GAGGCGTGCG
CGCGAGCAAT TGCAGGCGCT GCTCGGCGCT GCGCATCCGC ACGAACTGAT CTTCACCTCG
GGCGGCACTG AAAGCGCCAA TGCGGCGATC CTGTCGGCGC TGGAAGTAGC ACCCCGGCGG
CGCGAGATCA TCACCACCGC GGTCGAACAC CCCGCCGTTC TGTCGCTGTG CGCCTGGCTC
GAGAAGACCA AGGGCATCCG CGTCCACGTC ATCCCGGTCG ATCGTCACGG CCACCTCGAC
ATCGCCGCCT ATCGCGAGGC GCTGTCGGAT CGCGTCGCGT TGGTCTCGAT GATGTGGGCG
AATAACGAGA CCGGCGTGAT CGCACCGGTC GCCGATCTCG CCGAGCTTGC GAAGGATGTC
GGAGCGCTGT TTCACACCGA TGCGGTGCAG GCGGTCGGCA AATGTCCGAT CGACCTGCAA
TCCACCGCGA TCGACATGCT GTCGCTGTCC GGACACAAGT TGCATGGGCC GAAGGGCATC
GGCGCGCTGT ATGTCCGCAC TGGCGTCGGC TTCAAGCCGC AGATCAAGGG TGGCCAGCAC
GAGCGCGGCC GCCGCGCCGG CACCGAGAAC GTACCGGGCA TCGTCGGCCT CGGCATGGCC
GCAGAACTCG CCGCCGAAGC AATGGCCGAT GAGGACATCC GGGTGCGAGG CTTGCGCGAC
CGGCTGGAGC GCGGGGTCCT CGCGCAGGTC GACCATTGCG TGGCGATCGG CGCCCGGGCC
GAGCGGCTGC CCAACACGTC GAACATCGCG TTCTCCTTCA TCGACAGCGA GGCCATCGTC
ACGCTGCTCG ACCGCGCCGG CATTGCCGCC TCGATGGGTT CGGCTTGTTC GACCGGCTCG
TTCGAGCCAT CGCATGTGCT GATGGCCATG AAAATCGCGG AGGACACCGT CCGCGGCGGC
GTGCGGTTCT CACTGTCGCG CGACAACACC GACGACGACG TCGATCGGGC GCTCGCCGTG
ATCCCGGGCG TGGTCGCGAA GCTGCGCGCA ATCTCGCCGT TCGATGCCGA TGCAGGACCA
TTGCTCGGGC ATTCCCATGC TTGA
 
Protein sequence
MNQTRVPPQP ANICYLDANA TTRTDPRVVD AMLPFFSGYF GNPSSKHALG AHAAMAVRRA 
REQLQALLGA AHPHELIFTS GGTESANAAI LSALEVAPRR REIITTAVEH PAVLSLCAWL
EKTKGIRVHV IPVDRHGHLD IAAYREALSD RVALVSMMWA NNETGVIAPV ADLAELAKDV
GALFHTDAVQ AVGKCPIDLQ STAIDMLSLS GHKLHGPKGI GALYVRTGVG FKPQIKGGQH
ERGRRAGTEN VPGIVGLGMA AELAAEAMAD EDIRVRGLRD RLERGVLAQV DHCVAIGARA
ERLPNTSNIA FSFIDSEAIV TLLDRAGIAA SMGSACSTGS FEPSHVLMAM KIAEDTVRGG
VRFSLSRDNT DDDVDRALAV IPGVVAKLRA ISPFDADAGP LLGHSHA