Gene Rleg_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1069 
Symbol 
ID8015517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1042404 
End bp1043612 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID644823652 
Productcytochrome c-type biogenesis protein CcmI 
Protein accessionYP_002974903 
Protein GI241203807 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC GCTTCACGTA TTTGGGCGAC ATGCTGTTCT GGATTCTCGT TGCCGCTTTG 
ACGGCAGCCC TCGCCGTCAT CCTGCTCTAC CCCCTTCTGC GCGGAGCGAA GGCGGCGGAT
AATATCCGTG CGGGCGAGGC GGCGGTCTAT CGCGACCAGT TGCGCGAACT CGACCGCGAT
CTCGATGGCG GGCTGATCAC CCCGGAGGAG GCTGATTATG CGAGGGCGGA AATCGGCCGG
CGGCTGATCG CCGTCTCTGC CGACGAGCCG GCTGAGACGC CGAAACCCGC ACGGCATCAC
CGTTTCACCG AAGCCTTCGT TCTCCTGGTC CTGCCGGTTC TCGGGCTCTG TCTTTATTTG
ACGACGGGCA GGCCGGACCT GCCCTCGCAG CCGCTGGAGG CGCGGCTGGA AAACCCTGGC
AACGACGTGG CGGTGCTGAT CGCCAAGGCG GAACGGCACC TGGCTGAGAA GCCCGATGAC
GGCAAGGGCT GGGACGTGCT GGCGCCGATC TATTTCCGCA CGATGCGCGT CAACGATGCG
CAACTGGCCT ATCGCAATGC CATCCGGCTT CTCGGCCCGA GCCCGGTCCG GCTCGATGGC
CTTGCCGAGA CGCTGATGGC GGTCTCCGAC GGCGTGGTGA CGGAGGAGGC GCGGCAGGTG
CTGGAACAAT CGCTGACGCT CGAACCTGAC AATCCGCGTG CCCGCTTCTA CATCGCCCTC
AGCATGGAGC AGGCGGGACG GCCGAACGAG GCGCGCCAGG CCTTCGAGGC GCTGGCAAAA
CAATCGCCAT CAGATGCGCC CTGGCTGCCG CTGGTCAACC AGCATATCGC CATGAACGGC
GGCGCGCCGG CCGGGACCAA TCCGGCTGCC CCAGGCGCCG ATCCGGCTGC CCCGGGCGCC
CGTCCGGCTG CCCCCGGCAA TCCCACGCAG CAAGATGTGG CGGCGGCCGA GACTATGAGC
GCGGGCAACC AGCAGCAGAT GATCCGCGGC ATGGTCGAGA GCCTCGACGC CAAGCTCAGC
GAGGATCCGA ACAATTTCGA GGGATGGATG CGGCTCGTCC GCTCTTACGC CGTATTAAAC
GACAAGGATC GCGCCGCCGG CGCCTTGAAG CGTGGGCTTG CGGCCTTTCC GCCGCCCGGC
GAACAGGGCA GGCAATTGCT GACGCTTGCC AGGGAACTCG GCATAGCCAC GGAGGGAGCG
ACGCAATGA
 
Protein sequence
MKARFTYLGD MLFWILVAAL TAALAVILLY PLLRGAKAAD NIRAGEAAVY RDQLRELDRD 
LDGGLITPEE ADYARAEIGR RLIAVSADEP AETPKPARHH RFTEAFVLLV LPVLGLCLYL
TTGRPDLPSQ PLEARLENPG NDVAVLIAKA ERHLAEKPDD GKGWDVLAPI YFRTMRVNDA
QLAYRNAIRL LGPSPVRLDG LAETLMAVSD GVVTEEARQV LEQSLTLEPD NPRARFYIAL
SMEQAGRPNE ARQAFEALAK QSPSDAPWLP LVNQHIAMNG GAPAGTNPAA PGADPAAPGA
RPAAPGNPTQ QDVAAAETMS AGNQQQMIRG MVESLDAKLS EDPNNFEGWM RLVRSYAVLN
DKDRAAGALK RGLAAFPPPG EQGRQLLTLA RELGIATEGA TQ