Gene Rleg2_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3654 
Symbol 
ID6982416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3782171 
End bp3783700 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content63% 
IMG OID643398376 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_002283143 
Protein GI209551226 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCC GCGCCGCGGA AATTTCCGCA ATTCTCAAAG ACCAGATCAA AAATTTCGGC 
AAAGAGGCAG AAGTCTCGGA AGTCGGCCAG GTTCTCTCCG TCGGTGACGG TATCGCCCGC
GTCTACGGTC TGGACAATGT CCAGGCCGGT GAAATGGTCG AGTTCCCCGG CGGCATCCGC
GGCATGGCGC TGAACCTTGA ATCCGACAAT GTCGGCGTCG TTATTTTCGG CTCCGACCGC
GACATCAAGG AAGGCGACAC CGTCAAGCGG ACAGGCGCCA TCGTCGACGT CCCGGTCGGT
CCGGAATTGC TCGGCCGCGT CGTCGACGCG CTCGGCAACC CGATCGACGG CAAGGGCCCG
ATCAACGCGA CGCGCCGCTC GCGCGTCGAC GTCAAGGCGC CTGGCATCAT TCCGCGCAAG
TCGGTTCATG AGCCGATGTC GACCGGCCTC AAGGCCATCG ACGCGCTGAT CCCGGTCGGC
CGCGGCCAGC GCGAGCTGGT CATCGGCGAC CGCCAGACCG GCAAGACCGC CATTCTTCTC
GATGCCTTCC TCAACCAGAA GGCCATTCAC GACAACGGCC CGGAAGGCGA AAAGCTTTAC
TGCGTCTACG TCGCCGTCGG CCAGAAGCGC TCGACCGTTG CCCAGTTCGT CAAGGTGCTC
GAAGAGCGCG GCGCACTGAA GTATTCGATC ATCGTTGCCG CCACCGCTTC CGACCCGGCC
CCGATGCAGT TCCTGGCTCC GTTTGCCGGC TGCGCCATGG GCGAATATTT CCGCGACAAC
GGCATGCATG CGCTGATCGG TTATGACGAC CTGTCGAAGC AGGCCGTGTC CTACCGCCAG
ATGTCGCTGC TGCTGCGCCG CCCGCCGGGC CGCGAAGCCT ATCCGGGCGA TGTTTTCTAC
CTGCACTCGC GTCTGCTCGA GCGCGCTGCG AAGATGAACG ACGACAAGGG CGCCGGTTCG
CTGACCGCTC TGCCGGTTAT CGAAACGCAG GGCAACGACG TGTCGGCCTT CATTCCGACC
AACGTGATCT CGATCACAGA CGGCCAGATC TTCCTTGAAA CCGACCTGTT CTACCAGGGT
ATCCGCCCGG CCGTGAACGT CGGTCTGTCG GTTTCCCGCG TCGGCTCGTC GGCACAGATC
AAGGCGATGA AGCAGGTTGC CGGCTCGATC AAGGGCGAAC TCGCCCAGTA TCGCGAAATG
GCCGCCTTCG CCCAGTTCGG TTCGGACCTC GACGCTGCGA CGCAGCGGCT GCTGAACCGC
GGCGCACGCC TGACCGAACT CCTGAAGCAG CCGCAGTTCT CGCCGCTGAA GACGGAAGAG
CAGGTCGCGG TGATCTTCGC CGGCGTCAAC GGCTATCTCG ACAAGCTGCC GGTTGCTTCG
GTCGGCAAGT TCGAGCAGGG CTTCCTCTCC TATTTGCGTT CGGAAGGCTC TGCCATCCTC
GACGCGATCC GCACGGAAAA GGCAATCAGC GACGATACCA AGGGCAAGCT CACCGCTGCT
CTCGATAGCT TCGCAAAGTC TTTCTCGTAA
 
Protein sequence
MDIRAAEISA ILKDQIKNFG KEAEVSEVGQ VLSVGDGIAR VYGLDNVQAG EMVEFPGGIR 
GMALNLESDN VGVVIFGSDR DIKEGDTVKR TGAIVDVPVG PELLGRVVDA LGNPIDGKGP
INATRRSRVD VKAPGIIPRK SVHEPMSTGL KAIDALIPVG RGQRELVIGD RQTGKTAILL
DAFLNQKAIH DNGPEGEKLY CVYVAVGQKR STVAQFVKVL EERGALKYSI IVAATASDPA
PMQFLAPFAG CAMGEYFRDN GMHALIGYDD LSKQAVSYRQ MSLLLRRPPG REAYPGDVFY
LHSRLLERAA KMNDDKGAGS LTALPVIETQ GNDVSAFIPT NVISITDGQI FLETDLFYQG
IRPAVNVGLS VSRVGSSAQI KAMKQVAGSI KGELAQYREM AAFAQFGSDL DAATQRLLNR
GARLTELLKQ PQFSPLKTEE QVAVIFAGVN GYLDKLPVAS VGKFEQGFLS YLRSEGSAIL
DAIRTEKAIS DDTKGKLTAA LDSFAKSFS