Gene Rru_A3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3040 
Symbol 
ID3836486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3503624 
End bp3504769 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID637827155 
Productaminotransferase, class V 
Protein accessionYP_428122 
Protein GI83594370 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTTCGT GCCAGCGTGA CCTCTTTGAG ATCCCCCAGG ATGTGGCTTA CCTGAACGCC 
GCCTTCATGG GCCCCTTGAT GACCGAGGTG GTCGCCGCCG GCCATGCCGG GGTGGCGGCC
AAGGCCCGGC CCTGGGAAGT GGCGATCGAC GCCTTCTTCG ATCCGGTCGA AAAGGCGCGC
GGCCTGTATG CCGGGCTGAC CGGCGCCGAT GTCGAGGGCA TCGCCGTCGT TCCCTCAGTG
TCCTATGGCA TCGCCGTGGC GGCGGCCAAT CTGCCGCTGG CGGCGGGGAA GCGGGTGCTG
GTTCTGGAAG AGCAGTTCCC TTCCAATCTT TATTCGTGGC GTCGTCTGGC GACCGAGAAC
AACGCCGTCG TCCAGGTTGT CGCCCGCCCG GCCAACGGCA ATTGGACCGA GGCCCTGCTC
GGCGCCATCA AGCCGGGGGT CGCCATCGTC GCCTGTCCCC AGGCCCATTG GTCGGATGGC
TGCAAGATCG ATCTGGTCGC CATTGGTGCC GCCTGCCGTG CGGTCGGGGC GGCCCTGGTC
ATCGACGGCA CCCAGTCCTT TGGCGCCATG CCCTTCGACA CGGCGGCGGT CGATCCGGAT
TTCGCCGTCG CCGCCACCTA TAAGTGGCTG CTTGGCCCCT ATTCGCTGGG GTTCCTCTAT
GTGGCGCCGC GCCATCGCAA CGGTCAGCCG CTGGAAGAGG GCTGGATCTG CCGCGAAGGT
AGCCGGGATT TTTCGCGGCT GGTCGATTAC ACCGAGAGCA TGGACGCCGG GGCGCGGCGT
TTCGATGTGG GCGAACGCTC GAACTTCGCC CTGATGCCGA TGGCGATCGC CGCCATGGAG
CGCCTGACCG CCTGGACGCC CGCCGCCGTA TCGGCCTATG CCGGGCGGCT GACCGACCGG
GTGGTCGCCG AAACGGCGGC CTGGGGCTGC ACCGCCGCCC CCGCTAGCGC CCGCTCGCCC
CATTTGCTGG GGTTGGGTTT GCCGGAAGGG GTTGACGCCA AGGCCTTGGC GACCCGGCTG
GCCGCCGCCC AGGTCAGCGT CAGCGTGCGC GGCAGCCGCC TGCGCATCTC GCCCCACGTC
TATAACACCG ACGCCGATGT CGACCGCCTG CTTGGCGTTC TTGAAGACGC GCTGGCAAAG
GCCTGA
 
Protein sequence
MLSCQRDLFE IPQDVAYLNA AFMGPLMTEV VAAGHAGVAA KARPWEVAID AFFDPVEKAR 
GLYAGLTGAD VEGIAVVPSV SYGIAVAAAN LPLAAGKRVL VLEEQFPSNL YSWRRLATEN
NAVVQVVARP ANGNWTEALL GAIKPGVAIV ACPQAHWSDG CKIDLVAIGA ACRAVGAALV
IDGTQSFGAM PFDTAAVDPD FAVAATYKWL LGPYSLGFLY VAPRHRNGQP LEEGWICREG
SRDFSRLVDY TESMDAGARR FDVGERSNFA LMPMAIAAME RLTAWTPAAV SAYAGRLTDR
VVAETAAWGC TAAPASARSP HLLGLGLPEG VDAKALATRL AAAQVSVSVR GSRLRISPHV
YNTDADVDRL LGVLEDALAK A