Gene RPD_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3939 
Symbol 
ID4024455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4378341 
End bp4379585 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content60% 
IMG OID637964143 
ProductMobA/MobL protein 
Protein accessionYP_571061 
Protein GI91978402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATAT TTAGCCTGAA CCACAGCTTC ATTGGCCGGA CTACCGACCC AAAAGGCTCG 
GCAAGTCTTT TTGCGCGCTA CATTACTCGG CCCCAAGCCT GCACCGAAGT CGTCGGCGAG
CGCATGCCGC TTGATCGCGC AGCCATGATG CGCTGGCTCG ACGGGCAGGA ACAAAAAGAC
CGCCGAATTG CCCGGGTGAT CGACAAGGTC GTGGTCGCCC TCCCCATCGA ATTAACCCAT
GAGCAAAATG TGGAATTGCT CCAGGGCTTC TGCGAGCGAA TGACCCAGGG CAGTGCCTCT
TGGGCCGCTG CGGTCCATGA CGGCCCTGAC GACCTGGACA ACCCTCACGC CCATATCATC
TTCCGGGACC GAGACTGGCA CACCGGCAAG CGGGTGATGC TGACAACGGA GCAAGGCAGC
ACCCAGCGCT TCAGGGATGC GTGGGAGGAC GAGGTCAACC GCGCGCTCGA ACATGAAGGC
TTTGAGACAC GGATCGATAA GCGCAGCCTA AAGGAGGACC AAGGCGTCGA CCGCGAGCCG
CAACTCCATG TCGGCGCTGC CTCGAAGTAT CTTCACGGCA AGGAGCATGA ATTCCGCAGC
GAAGAGAAGC AAACTACCCG CATGATTGAC GGCGTGCCCG TGCAAGTCAT CGTCAACTAC
CCCGCGATCG ACGAGGGCAA GACCCGCTTC CAGGAGAACG AGGATCGCAA GACGCGGAAT
GCCGAGCAGG AGCGTGCGAT GGCAGGCATC TTTGCGGCAG AGCGCGAACT CCACAATATC
TATGTGAAGG CGAGCAAATC CGGCACGCCA CCCGACGATC CGAGCGATCC GCTCGCCACC
ATCGTCGCCT TCCACATGCG GGATGCCACC CGGACTGAGG AGCAGCGCGA GAAGTATGAG
CTGTGGACGT GGCGCCCGCT CCAAAACAAC ATCGGCAAGC CCTTCGAGCC GGCGAGCAAG
CTCAAAGTTC CAAGCGACAT GGTCGCCGGA GCCGGCCTCT CCATCGTCGG CAAGATCGCC
AAGTCACTGG AATCAATTTT TGATGGACCC CAACGGGACC CAGAGGACAC GGAGCAAAAC
ATGGCCGAAA GGCAAGTCAC ACCACAGCAG CAGCGTGTCG AAGCTAACCT GCGTGAGCAG
GCGCAGCGCA CGCATGAAGC CGACATCGCC AAGTGGCGGC AGAAGGAGTT GGATGCGTAT
CTAGACCAGC GGGATAAAGA ACGGCACATG GACCGAGGTA GATGA
 
Protein sequence
MAIFSLNHSF IGRTTDPKGS ASLFARYITR PQACTEVVGE RMPLDRAAMM RWLDGQEQKD 
RRIARVIDKV VVALPIELTH EQNVELLQGF CERMTQGSAS WAAAVHDGPD DLDNPHAHII
FRDRDWHTGK RVMLTTEQGS TQRFRDAWED EVNRALEHEG FETRIDKRSL KEDQGVDREP
QLHVGAASKY LHGKEHEFRS EEKQTTRMID GVPVQVIVNY PAIDEGKTRF QENEDRKTRN
AEQERAMAGI FAAERELHNI YVKASKSGTP PDDPSDPLAT IVAFHMRDAT RTEEQREKYE
LWTWRPLQNN IGKPFEPASK LKVPSDMVAG AGLSIVGKIA KSLESIFDGP QRDPEDTEQN
MAERQVTPQQ QRVEANLREQ AQRTHEADIA KWRQKELDAY LDQRDKERHM DRGR