Gene Rru_A2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2014 
Symbol 
ID3835439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2328282 
End bp2329886 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content64% 
IMG OID637826114 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_427101 
Protein GI83593349 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.533924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTGT CCCGCAGAAG TTTTCTTGCC TCAACCGCCC TTCTGGCGGG AGCGGCGGCC 
CTGCCGCGTT TCTCCTGGGC GCAGGGCGCC CCGGCGCCGG TCGCCGGCGG CGTGTTGACC
GCCCATCTCA GTTCCGAGCA GCGCATCCTC AATCCGGCGC TGCGCGCCTC GACGGGGGTC
TATGTCATCA CCAGCAAGAT CATCGAATCG CTGGTTGATC TTGGCCCCGA TGGCGCGCCG
ACGCCGGTTC TCGCCACGCG TTGGGAGGCC GCCGCCGATG GCAAATCGGT GACCTTCACC
CTGCGCGAGG GGGTGAAGTG GCACGACGGC AAGCCCTTCA CCTCGGCCGA CGTTCAGTAT
TCGGCGATGG AGCTGTGGAA GAAGCACCTG AATTACGGCA CCCAGCTTCA GCGCTATCTG
GAAGCCGTCG ACACCCCCGA CGCCACCACG GCGATCTTCC GCTATAGCCG GACCATGCCG
CTGCCCCTGT TGCTGCGCGC CCTGGCCGAT CTGGGCTATG TCGTGCCGCG CCATCTGTTC
GAGGGCACCA ACGTTCTGGA AAACCCGGCG AATACGGCGC CGATCGGTAC CGGTCCGTTC
AAATTCGTCG AATACCAGCG CGGCCAGTAT ATCGTCGCCG AGCGCAATCC CGAGTATTGG
CGGAAGGGCG AACCCTATCT CGACCGCGTC GTCTGGCGCT TCATCACCGA TAAATCGGCC
GCCAGCGCCG CCCTGGAAAC CGGGCAGGTG CAGATCAGCG CCTATACCCA GCTCGCCCTG
TCGGACATCG AACGCCTGGC CAAGGATTCG CGCTTCGAGG TGTCGTCGCG CGGCAACGAG
GCCAATTCGT TCAACAATAC GGTCGAGTTC AACCATCGCC GCAAGGAACT GGCCGATGTC
CGCGTCCGCC GCGCCATCGC CCATGCCGTC GATGTCGATT TCTTCGTCGA GAACTTCCTC
TATGGGCGCG GCAAGCGGGC GACCGGCTTC ATCCCGTCGA TTTCCCAGGC CTTTTATCCG
GGCGGCGCCT TCCCCTATCC GTTTGACACC AAGAAGGCCG AGGCCCTGCT CGACGAGGCC
GGCTATCCGC GCCAAAAGGG CGGCGAGCGC TTTTCGCTGC GCCTGTTGCC GATCCTGAAC
GGCGAGGATG TGCCCCAGTT CGCCACCTTC CTTCAGCAGT CGCTGGCCGA GGTCGGCATC
AAGGTCGAGA TCGTCCAGCT TGATGTCGCT GGCGCCCTGT CGGCGATCTA CAAGGATTGG
AACTTCGATC TGGCGACCGG CTGGCACCAG TATCGGGGCG ATCCGGCGGT GTCGACCACC
GTGTGGTTCC GCTCGGGTAG CCCCAAGGGC GCGCCGTGGA CCAACCAGTT CGGTTGGGAA
TCGGCCGAGG TCGATACGCT GATCGACGAT GCCGCCGCCG AGATCGATCC GGTCAAGCGC
AAGGCCCTTT ACGCCCAATT GGTCGATGTG ATCAACGAAG AGTTGCCGGT GTGGTTCGCC
ACCGAGCGGC AATTCGTGTC GGTCACCAAT AAAGTCGTCC AAAACCACCA TAATATCCCG
CGCTGGCCGT CCAGTGACTG GCATGATACC TGGATTGCCA AGTAG
 
Protein sequence
MSLSRRSFLA STALLAGAAA LPRFSWAQGA PAPVAGGVLT AHLSSEQRIL NPALRASTGV 
YVITSKIIES LVDLGPDGAP TPVLATRWEA AADGKSVTFT LREGVKWHDG KPFTSADVQY
SAMELWKKHL NYGTQLQRYL EAVDTPDATT AIFRYSRTMP LPLLLRALAD LGYVVPRHLF
EGTNVLENPA NTAPIGTGPF KFVEYQRGQY IVAERNPEYW RKGEPYLDRV VWRFITDKSA
ASAALETGQV QISAYTQLAL SDIERLAKDS RFEVSSRGNE ANSFNNTVEF NHRRKELADV
RVRRAIAHAV DVDFFVENFL YGRGKRATGF IPSISQAFYP GGAFPYPFDT KKAEALLDEA
GYPRQKGGER FSLRLLPILN GEDVPQFATF LQQSLAEVGI KVEIVQLDVA GALSAIYKDW
NFDLATGWHQ YRGDPAVSTT VWFRSGSPKG APWTNQFGWE SAEVDTLIDD AAAEIDPVKR
KALYAQLVDV INEELPVWFA TERQFVSVTN KVVQNHHNIP RWPSSDWHDT WIAK