Gene Rleg2_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5020 
Symbol 
ID6978114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp665939 
End bp667510 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content60% 
IMG OID643394165 
Productcytochrome c oxidase accessory protein CcoG 
Protein accessionYP_002278983 
Protein GI209547065 
COG category[C] Energy production and conversion 
COG ID[COG0348] Polyferredoxin 
TIGRFAM ID[TIGR02745] cytochrome c oxidase accessory protein FixG 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCT ACACAGCACC AGATCCCAGC CCGGTCGAAC GGATAGACGC GGAACCCGTG 
AACGCCCGCG GCAAACGTGA ACCACTTTAC GCTGCCCGAA AGAAGGTATT TCCCAAGCGC
GCCGAAGGGC GCTTTCGGCG TTTCAAGTGG ATCGTGATGA TGATCACACT TGGGATCTAC
TACCTGGCTC CTTGGATTCG TTGGGATCGG GGACCCTACG CGCCGGATCA GGCTATCCTG
GTGGACCTGG CGTCACGGCG CTTTTTCTTC TTCTTCATCG AGATCTGGCC GCAGGAATTC
TATTACGTAG CGGGCCTGCT TGTCATGGCA GGCTTCGGTC TCTTTCTCGT CACATCTGCC
GTCGGGCGGG CGTGGTGCGG ATATGCTTGT CCGCAAACCG TCTGGGTCGA TCTGTTCCTT
GTCGTCGAGC GGGCGATCGA GGGAGACCGC AACGCCCGCA TGAAGCTTGA TGCCGGGCCG
TGGACTTTCG ACAAGGCAAG AAAAAGGGTC GTCAAGCACG CAATCTGGCT GATGATCGGC
GTCGCGACAG GGGGGGCTTG GATCTTCTAT TTCGCCGACG CGCCGACGCT TGTCGTCTCG
CTTTTCACAG CGAAGGCACC TGTCGTGGCC TACGCGACCG TCGCTACTCT CACTGCGACC
ACCTACGTCC TGGGCGGGCT GATGCGCGAA CAGGTCTGTA CGTACATGTG CCCGTGGCCG
CGAATCCAGG GCGCGATGCT CGATGAAAAT TCTCTTGTCG TTACGTACAA CGATTGGCGC
GGCGAGGCCC GCTCACGCCA CGCCAAGAAG ATCTTGGCTG CCGGTCAATC GGTCGGCGAT
TGTGTCGACT GCAACGCCTG CGTGGCCGTC TGCCCCATGG GTATCGACAT CCGCGACGGA
CAGCAGATGG AGTGTATTAC CTGCGCCTTG TGCATCGACG CCTGCGATGG CGTCATGGAC
AAGGTGGGAA AGCCGCGTGG CCTCATAGCG TATGCGACGC TCAGCGAATA CGCGGCCAAC
ATGGCCATCG CGACGGACGA CGGAAGAACG CCGGTCCAGC CCACGAAGGT GCGGAATGCC
GACGGTTCGT TCGTGGAGGC CATCCGGCAT TTCGACTGGC GCATCATATT CCGTCCACGC
GTACTGTTTT ATGCAGTCAC CTGGCTTTTG ATCGGCGTCG CCATGGTGGT CCATCTCGCA
ATGCGCGAGC GACTGGAGCT CAACGTGGTT CACGATCGAA ACCCGCAATA CGTCTTGGAG
AGCGACGGCT CCATCCGCAA CGGCTACACG CTGCGAATAC TGAATATGGT TCCCGCTCCT
CGCAGGATCG AGCTCAGCCT TTTAGGCCTC GACGACGGCG CCAGCATGCG CATTCCTGAA
CTGACCAAGC AGGACGCGCG GACCTTTATC ATCGAAGCTG CGCCGGACGT CGCGACAACA
GTCAAGGTCT TCGTGACGAG CAAACAGTCG ACCGCCGCGA TCAGCGAGTT CCTTTTTGCC
ATCGAGGACT CCGGACATTC GGATAGGGCG ACCTATCGCG CCGCATTCAA CACACCGGGA
GATGCCAAAT GA
 
Protein sequence
MNIYTAPDPS PVERIDAEPV NARGKREPLY AARKKVFPKR AEGRFRRFKW IVMMITLGIY 
YLAPWIRWDR GPYAPDQAIL VDLASRRFFF FFIEIWPQEF YYVAGLLVMA GFGLFLVTSA
VGRAWCGYAC PQTVWVDLFL VVERAIEGDR NARMKLDAGP WTFDKARKRV VKHAIWLMIG
VATGGAWIFY FADAPTLVVS LFTAKAPVVA YATVATLTAT TYVLGGLMRE QVCTYMCPWP
RIQGAMLDEN SLVVTYNDWR GEARSRHAKK ILAAGQSVGD CVDCNACVAV CPMGIDIRDG
QQMECITCAL CIDACDGVMD KVGKPRGLIA YATLSEYAAN MAIATDDGRT PVQPTKVRNA
DGSFVEAIRH FDWRIIFRPR VLFYAVTWLL IGVAMVVHLA MRERLELNVV HDRNPQYVLE
SDGSIRNGYT LRILNMVPAP RRIELSLLGL DDGASMRIPE LTKQDARTFI IEAAPDVATT
VKVFVTSKQS TAAISEFLFA IEDSGHSDRA TYRAAFNTPG DAK