Gene Rru_A1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1201 
Symbol 
ID3833699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1418730 
End bp1419791 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content65% 
IMG OID637825290 
ProductMaoC-like dehydratase 
Protein accessionYP_426289 
Protein GI83592537 
COG category[I] Lipid transport and metabolism 
COG ID[COG2030] Acyl dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAGA CCAATCCGGG TAATTTCTTC GAGGACTTCG CTCCCGGCCA GCAGCTTGTC 
CATGCCACGC CGCGCACGCT GACCGAGGGC GACGCCGCCC TTTACACAGC CCTTTACGGG
TCACGCTTCG CCGTGCAGTC CTCGGCCTCC TTCGCCATGG CGATCGGCTA TCCGGAAGCG
CCTTTGGACG ATCTGCTGGT CTTCCATGTG GTCTTTGGCA AGACCGTGCC CGACATCAGC
CTGAACGCCG TGGCCAATCT GGGCTATGCC CGGGGGCGGA TGGGGGTGCC GGTCTACCCG
GGCGATACCC TGCGCGCCCT CAGCCGGGTG ATCGGCGTCA AGGAGAACTC CAACGGCAAG
ACCGGGGTGG TCTATGTCAA TTCCGTCGGT CTGAATCAGA ACGACGAGGT GGTGGTCGAT
TTCATTCGCT GGGTTATGGT GCAAAAGCGC GATCCGGCCC ACCCGGCCCC CGAACCCGAG
ATCCCCGATC TGCCCGACCG CGTCGCGCCC GAGGATCTTT ACCTGCCCGA GGGCCTTGAT
CCGCGCGGCT ATGACCCGGA ACTGGCCGGT TCGGCCCATT TCTGGGAGGA TTACGCGGTC
GGCGAGCGCA TCGACCACGG CGATGGCATG ACCATCGAGG AAGCCGAGCA TATGATGGCG
ACGCGGCTGT GGCAGAATAC CGCCAAGGTC CATTTCAACC AATATGAGCA GGCCAAGGGG
CGGTTCGGCC GTCGGCTGGT CTATGGCGGC CATGTCATCA GCCTGGCCCG GGCGCTCAGT
TTCAACGGCC TGGGCAACGC CTTCCGCGTC GCGGCGATCA ATGCCGGCAG CCACTGCAAT
CCGACCTTCG CCGGCGATAC CATCCACGCT TGGTCGGAGG TGCTCGAGCG CGCCGATCTG
CCCGCCGACG AGGGGTTTGG CGCCCTGCGC CTGCGCACCA TCGCCACCAA GGACCGCGCC
TGCGCCGATT TTCCCTATCG CGACGAGCAG GGCCACATCC GCCCCGAGGT CGTGCTTGAT
CTCGACTATT GGGTGGTGAT GCCCAAGGCG GAAGGGAAAT GA
 
Protein sequence
MGKTNPGNFF EDFAPGQQLV HATPRTLTEG DAALYTALYG SRFAVQSSAS FAMAIGYPEA 
PLDDLLVFHV VFGKTVPDIS LNAVANLGYA RGRMGVPVYP GDTLRALSRV IGVKENSNGK
TGVVYVNSVG LNQNDEVVVD FIRWVMVQKR DPAHPAPEPE IPDLPDRVAP EDLYLPEGLD
PRGYDPELAG SAHFWEDYAV GERIDHGDGM TIEEAEHMMA TRLWQNTAKV HFNQYEQAKG
RFGRRLVYGG HVISLARALS FNGLGNAFRV AAINAGSHCN PTFAGDTIHA WSEVLERADL
PADEGFGALR LRTIATKDRA CADFPYRDEQ GHIRPEVVLD LDYWVVMPKA EGK