Gene Rru_A2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2475 
Symbol 
ID3835909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2875245 
End bp2876444 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content64% 
IMG OID637826583 
Productglycine betaine/L-proline transport ATP-binding subunit 
Protein accessionYP_427562 
Protein GI83593810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAACA AGATTGCCGT CAAAGGCCTT TACAAGGTTT TTGGCGATAC CCCCGAGACG 
GCGATCGCCC TGTCGCGATC GGGAGCCGAC CGGAGTGCCA TTCAGCGCAA GACCGGTATG
ACGATGGGGG TTTGCGACGC CTCGTTCACC ATCCAAGAGG GTGAGATCTT CGTGATCATG
GGGCTCTCGG GATCGGGGAA ATCCACCCTG GTCCGCCTGC TCAACCGGTT GATCGAACCC
AGCGCCGGCA GCATCGTCAT CGACGGGATC GACATCGCCG GGCTTGCCGA AGCCGATCTC
AACGCCTTTC GCCACCGTCA CCTCAGCATG GTCTTCCAGT CCTTCGCGCT GATGCCGCAT
TTGACGGTTT TGGGAAACGC CGCCTTCGGT CTGGAGCTGT CGGGTGTCAA ACGCCCCGAA
CGCGAAGCCC GCGCCCGCGC CGCCCTCACC CAGGTCGGAT TGGCCGATTG GGCGAACCAG
TTTCCCACCG CCTTATCGGG GGGCATGCGC CAGCGCGTCG GCCTCGCCCG GGCCCTGGCC
ACCGATCCCG ATGTTTTGCT GATGGACGAG GCTTTCAGCG CCCTTGACCC GCTGAACCGC
GCCGAAATGC AAGACCAGTT GCTCAGCCTG CAAGACGAGC AGCGGCGCAC CATCATTTTC
ATCACCCACG ACCTCGACGA GGCCATGCGC ATCGGCGATC GCATCGCCAT CATGGAAGCC
GGGCGCATCG TGCAGATCGG CACCCCCGAC GAGATCTTGA ACAATCCGGC CGATGACTAT
GTCCGGTCGT TTTTCCGCGG GGTCAATCTG GGGGCGGTCT TGTCGGCCGG CTCCATCGCC
CGCAAGCGGC AGGTCACGAT GATCGACCGC GAAGGCGGCC TGCGCGTCGC CCTTCAGCGT
CTGGCCGAGG CCGATCGCGA TTACGCCTAT GTGATCGACA AGGCGCAACG CTACCACGGC
GTGGTCTCGG TCGCCTCGCT CGAAGCCCTC CGCCACAAGC CCGGCGCCGC CCTGCACGAC
GCCTTCCTTG GTGATGTCTC GCCGGTGCCG GCCGCGACGA TCCTGTCAGA GATCATCACC
CAGGTCGCCC AGGCGCCTTG CGGTCTTCCC GTCATCGACG ACGCCGGTCG CTACCTTGGC
GTCATCTCGC GGGCCCTTCT TCTTGAAACC CTCGACCGGG AGACCCTGAC CAATGGCTGA
 
Protein sequence
MPNKIAVKGL YKVFGDTPET AIALSRSGAD RSAIQRKTGM TMGVCDASFT IQEGEIFVIM 
GLSGSGKSTL VRLLNRLIEP SAGSIVIDGI DIAGLAEADL NAFRHRHLSM VFQSFALMPH
LTVLGNAAFG LELSGVKRPE REARARAALT QVGLADWANQ FPTALSGGMR QRVGLARALA
TDPDVLLMDE AFSALDPLNR AEMQDQLLSL QDEQRRTIIF ITHDLDEAMR IGDRIAIMEA
GRIVQIGTPD EILNNPADDY VRSFFRGVNL GAVLSAGSIA RKRQVTMIDR EGGLRVALQR
LAEADRDYAY VIDKAQRYHG VVSVASLEAL RHKPGAALHD AFLGDVSPVP AATILSEIIT
QVAQAPCGLP VIDDAGRYLG VISRALLLET LDRETLTNG