Gene Rru_A2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2008 
Symbol 
ID3835433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2320063 
End bp2321928 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content67% 
IMG OID637826108 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_427095 
Protein GI83593343 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.578808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCGC CGTTTTTATC CTCCCTATCC CCCACCAGTC CGCTGGCCAG CGCCACGGCG 
CCCTTTCCCG GCTCGCGCAA GGTTTATGCC CGCCCGGCCG ACGCCCCCCA TTTGCGCGTG
CCCTTTCGCG AGATCATCCT GTCCGACCCG GGCGAAGCCC CGGTGCGGGT GGCCGACCCT
TCGGGCCCCT ATAGCGATCC CGAGGCGACG ATCGATCTGC GCCAGGGATT GGCCCGCCAT
CGGGCGTCTT GGGCCAGCGC CCGGGGCAAT TCCACGGTCA CGGCCGGCCG CCCCGCCCCG
AGTGAAGGCG ATTTCGAGGC CTTTCCCCTG ACCTACGCCC CCTTGCGCCG CCGCGATGAG
ACCCCCTTCA CCCAACTGGA ATACGCCCGC GCCGGAGTGA TCACCGACGA GATGATCTAT
GTGGCGACGC GGGAGAACCT GGGCCGGGAC AGCGCCGTGG CCGGGGCCTG CGCGCGGTTG
GCGGGCGGCG AGGCCTTTGG CGCCGCCCTG CCCGCCCATG TCACGCCCGA GTTCGTCCGC
GCCGAGATCG CCGCCGGGCG GGCGATCATC CCGGCCAACA TCAACCATCC CGAGCTTGAA
CCGACGATCA TCGGCCGCAA CTTCCTGGTC AAGGTCAACG CCAATATCGG CAATTCGGCC
CTGGGATCGT CGATCGAGGA CGAGGTGGCA AAGCTGGTCT GGGCCATCCG CTGGGGGGCC
GACACGGTGA TGGATCTGTC GACGGGCAAG GCCATCCACG CCACCCGCGA ATGGATCTTG
CGCAACAGTC CGGTGCCCAT CGGCACCGTT CCCCTGTATC AGGCTTTGGA AAAGGTGGGC
GGCGACGCCA CCCGCCTTGA CTGGGCGGTG TTCGAAGACA CCCTGATCGA ACAATGCGAA
CAGGGCGTTG ATTATTTCAC CATCCATGCC GGGGTGCGGC TGGCCCATAT TCCGCTGACC
GCGTCGCGCA CCACCGGCAT CGTCAGTCGC GGCGGTTCGA TCCTGGCCAA ATGGTGCTTG
TCCCACCACC GCGAGAATTT CCTTTATGAG CGCTTCGCCG ATATCTGCGC CATCCTGCGC
CGCTATGACG TGGCCTTTTC GCTAGGCGAC GGCCTGCGCC CGGGATCGGT GGCCGATGCC
AATGACGCCG CCCAATTCGC CGAACTCGAC ACTTTGGGCG CCTTGACGGC GGTGGCCTGG
GAGCACGGCT GTCAGGTGAT GGTCGAAGGC CCGGGCCATG TGCCCATGCA CAAGATCAAG
GCCAATATGG ACCGCCAACT CGCCACCTGC GGCGAGGCGC CGTTCTATAC CCTGGGGCCC
TTGACCACCG ATATCGCGCC CGGCCACGAC CACATCACCT CGGCGATCGG CGCGGCGATG
ATCGGCTGGT TCGGCACCGC CATGCTGTGC TACGTCACCC CCAAGGAACA CCTGGGGCTG
CCCGATCGCG CCGACGTCAA GGCCGGGGTG ATCGCCTATA AGCTGGCCGC CCATGCCGCC
GATATCGCCA AGGGGCATCC CGCCGCCCAG CTGCGCGACG ACGCCATCAG CAGGGCGCGC
TTCGATTTCC GCTGGAGCGA CCAGTTCAAC CTCGGCCTCG ATCCCGAAGG CGCGCGGGCC
TTCCACGACG AAACCCTGCC CCATGCCGCC CATAAGACGG CGCATTTCTG TTCGATGTGC
GGGCCGAAGT TCTGTTCGAT GAAGATCAGC CATGATATCC GCGACGGCGC CCTCGAGGGG
GCGGACGCCC TGACCCAGGC CGGACTGGAC CAGATGAGTG CGACCTTCCG CGCCAGCGGC
GGCGAGGTCC ACCTTGATGC GCAAGCTCTC GACGCCCTCG CCTGGGAGGG GAAACCCGCG
CGATAA
 
Protein sequence
MTAPFLSSLS PTSPLASATA PFPGSRKVYA RPADAPHLRV PFREIILSDP GEAPVRVADP 
SGPYSDPEAT IDLRQGLARH RASWASARGN STVTAGRPAP SEGDFEAFPL TYAPLRRRDE
TPFTQLEYAR AGVITDEMIY VATRENLGRD SAVAGACARL AGGEAFGAAL PAHVTPEFVR
AEIAAGRAII PANINHPELE PTIIGRNFLV KVNANIGNSA LGSSIEDEVA KLVWAIRWGA
DTVMDLSTGK AIHATREWIL RNSPVPIGTV PLYQALEKVG GDATRLDWAV FEDTLIEQCE
QGVDYFTIHA GVRLAHIPLT ASRTTGIVSR GGSILAKWCL SHHRENFLYE RFADICAILR
RYDVAFSLGD GLRPGSVADA NDAAQFAELD TLGALTAVAW EHGCQVMVEG PGHVPMHKIK
ANMDRQLATC GEAPFYTLGP LTTDIAPGHD HITSAIGAAM IGWFGTAMLC YVTPKEHLGL
PDRADVKAGV IAYKLAAHAA DIAKGHPAAQ LRDDAISRAR FDFRWSDQFN LGLDPEGARA
FHDETLPHAA HKTAHFCSMC GPKFCSMKIS HDIRDGALEG ADALTQAGLD QMSATFRASG
GEVHLDAQAL DALAWEGKPA R