Gene Rru_A3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3001 
Symbol 
ID3836446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3456663 
End bp3458474 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content64% 
IMG OID637827115 
Productcarbon starvation protein CstA 
Protein accessionYP_428083 
Protein GI83594331 
COG category[T] Signal transduction mechanisms 
COG ID[COG1966] Carbon starvation protein, predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAACG CCCTAACGTT CGTGATCTCG ACACTGTGTA TCCTCGCGCT TTGTTACCGC 
TTCTATGGCG TGTTCTTTGT GCGGAAGGTA CTGCGTGCCG ATGATTCAGT CGTAACCCCC
TCGCATACGT TCGAGGACGG CAAGAATTAC GTACCCACGA AAAAGTGGGT CAACGCCGGT
CAGCATTTCG CCGCCATCGC CGCCGCCGGT CCGCTGGTCG GTCCGGTTCT GGCCGCCCAG
TTCGGCTACC TTCCCGGTTT CCTGTGGTTG CTCGTCGGCT GCGTCGTCGG CGGCGCCGTT
CACGACACGG TCGTGCTCTT CGCCTCGATG AAGCACAACG GCAAGTCGCT CTCCAATGTC
GCCAAGGCCG AATTGGGCCC GGTGGCGGGG TGGTGCACCG GTCTGGCGAT GCTGTTCATC
ATCACCATCA CCATGGCCGG CCTGTCGATG GTGGTCGTTC ACGCGCTGGA ACGGAACCCC
TGGGGAACCT TCGCCGTGTT CATGACCATT CCCATCGCCA TCGCCGTGGG TCTGTACGAA
CGCTTCACCG GCAACCACAA GGGCGCCACC TGGGTTGGCA TCATCGCCAT CATGGTCGCG
GTTCTGGCCG GTCCCTATAT CCAGGGCACG CTGCTTGGCG ATTGGCTGAC GCTGCGCGTC
GACTCCGTCG CCCTCGCCCT GCCGATCTAC GCCTTCTTCG CCAGCGCCCT GCCGGTGTGG
CTGCTGCTGA CGCCGCGCGG CTATCTGTCG AGCTTCATGA AGATCGGCGT CTTCGGCGCC
CTGGTCGTCG GCGTCGTCAT CATCAACCCG ACCATCCAGT TCCCCGCCCT GACCGATTTC
ATCCACGGCG GTGGTCCGGT CCTGGCCGGC CCGGTGTGGC CCTTCATCTC GATCACCATC
GCCTGCGGCG CCATCTCGGG GTTCCACGCC TTCATCGGCT CGGGCACCAC GCCCAAGCTG
GTCGATAAGT GGAGCGACAT CCGCCCCGTC GCCTTTGGCG CCATGCTCGC CGAATGCATG
GTCGCCGTGC TGGCCCTGGT CGCCGCCACC GCCCTGCACC CCGCCGATTA CTTCGCCATC
AACGCCAGCC CCGCCGCCTT CGCCAGCCTG GGCATGGAGG TCGTCGACCT GCCGCACCTC
AGCCAGGAGA TCGGCATGGA TCTGGCCGGC CGCACCGGCG GCGCCGTCAC CCTGGCCGTC
GGCATGACCT TCATCTTCAC CAAGCTGCCC TGGTTCGCCA CGCTGTCGTC CTACTTCTTC
CAGTTCGTCA TCATGTTCGA GGCGGTGTTC ATCCTGACCG CCGTCGACTC GGGTACCCGG
GTCGCCCGCT ACCTGATCCA GGACCTGGGT GGCGATCTTT ACGCCCCGCT CAAGCGCCTG
GACTGGATCC CCGGTTCGAT CGGCGCCAGC GTCGCCGCCT GCGCGCTGTG GGGCTACCTG
CTGACCTCGG GCGATATCAA CTCGGTCTGG GCGCTGTTTG GAGTGTCGAA CCAGCTGATG
GCCTCGATCG GCCTGATCAT CGGCGCCACG ATCATCCTGC GGGTCGCCGC CAAGCGCATC
TACATGCTGA CCTGCCTGAT CCCGCTGGCC TACCTGTTCG TCACGGTGAA TTACGCCGGC
TACTGGATGA TCGCCAACGT TTATCTGAAC ACCCAGGCCC GCGGCTACAA CCCGATCAAC
GCCGGCATCT CGGCCATCAT GATGGTGCTC GGCCTGATCA TCCTGATCAC CGCTTTTGGC
AAATGGAAAA CGATGCTCGC CCTGCCCAAG AGCCTGCGCT CGGGTGAGGT GCCGGCCCCG
TCGCTGCCCT GA
 
Protein sequence
MDNALTFVIS TLCILALCYR FYGVFFVRKV LRADDSVVTP SHTFEDGKNY VPTKKWVNAG 
QHFAAIAAAG PLVGPVLAAQ FGYLPGFLWL LVGCVVGGAV HDTVVLFASM KHNGKSLSNV
AKAELGPVAG WCTGLAMLFI ITITMAGLSM VVVHALERNP WGTFAVFMTI PIAIAVGLYE
RFTGNHKGAT WVGIIAIMVA VLAGPYIQGT LLGDWLTLRV DSVALALPIY AFFASALPVW
LLLTPRGYLS SFMKIGVFGA LVVGVVIINP TIQFPALTDF IHGGGPVLAG PVWPFISITI
ACGAISGFHA FIGSGTTPKL VDKWSDIRPV AFGAMLAECM VAVLALVAAT ALHPADYFAI
NASPAAFASL GMEVVDLPHL SQEIGMDLAG RTGGAVTLAV GMTFIFTKLP WFATLSSYFF
QFVIMFEAVF ILTAVDSGTR VARYLIQDLG GDLYAPLKRL DWIPGSIGAS VAACALWGYL
LTSGDINSVW ALFGVSNQLM ASIGLIIGAT IILRVAAKRI YMLTCLIPLA YLFVTVNYAG
YWMIANVYLN TQARGYNPIN AGISAIMMVL GLIILITAFG KWKTMLALPK SLRSGEVPAP
SLP