Gene Rleg_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3943 
Symbol 
ID8014759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4016364 
End bp4017893 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content62% 
IMG OID644826512 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_002977723 
Protein GI241206627 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCC GCGCCGCGGA AATTTCCGCA ATTCTCAAAG ACCAGATTAA AAATTTCGGC 
AAAGAGGCAG AAGTCTCGGA AGTCGGCCAG GTTCTCTCCG TCGGTGACGG TATCGCTCGT
GTCTACGGTC TGGACAATGT CCAGGCCGGT GAAATGGTCG AGTTCCCGGG CGGCATCCGC
GGCATGGCCC TGAACCTCGA ATCCGACAAT GTCGGCGTCG TCATCTTCGG CTCCGACCGT
GACATCAAGG AAGGCGACAC CGTCAAGCGG ACCGGCGCCA TCGTTGACGT TCCAGTTGGT
CCGGAACTGC TCGGCCGCGT CGTCGACGCG CTCGGCAATC CGATCGACGG CAAGGGCCCG
ATCAACGCGA CGCGCCGTTC GCGCGTCGAC GTCAAGGCTC CCGGCATCAT TCCGCGCAAG
TCGGTTCATG AGCCGATGTC GACCGGCCTC AAGGCCATCG ACGCGCTGAT CCCGGTCGGC
CGCGGCCAGC GCGAGCTGGT CATCGGCGAC CGCCAGACCG GCAAGACTGC GATCCTTCTC
GATGCCTTCC TCAACCAGAA GGCCATTCAC GACAACGGTC CGGAAGGCGA AAAGCTTTAC
TGCGTCTACG TCGCCGTCGG CCAGAAGCGT TCGACCGTCG CCCAGTTCGT CAAGGTGCTC
GAAGAGCGCG GCGCACTGAA GTATTCGATC ATCGTTGCCG CCACCGCTTC CGATCCGGCG
CCGATGCAGT TTCTGGCGCC GTTTGCCGGC TGCGCCATGG GCGAATATTT CCGTGACAAC
GGCATGCATG CGCTGATCGG CTACGACGAC CTGTCCAAGC AGGCCGTGTC CTACCGCCAG
ATGTCGCTGC TGCTGCGCCG CCCGCCGGGC CGCGAAGCCT ATCCGGGCGA CGTTTTCTAT
CTGCACTCGC GCCTGCTCGA GCGCGCTGCG AAGATGAACG ACGACAAGGG CGCCGGTTCG
CTCACCGCTC TGCCGGTCAT CGAAACGCAG GGCAACGACG TGTCGGCCTT CATTCCGACC
AACGTGATCT CGATCACCGA CGGCCAGATC TTCCTTGAAA CCGACCTGTT CTACCAGGGT
ATCCGCCCGG CTGTGAACGT CGGTCTGTCG GTTTCCCGCG TCGGTTCGTC GGCGCAGATC
AAGGCGATGA AGCAGGTTGC CGGCTCGATC AAGGGCGAAC TCGCCCAGTA TCGCGAAATG
GCCGCCTTCG CCCAGTTCGG TTCGGACCTC GACGCTGCGA CGCAGCGCCT GCTGAACCGC
GGTGCACGCC TGACCGAACT CCTGAAGCAG CCGCAGTTCT CGCCGCTGAA GACGGAAGAG
CAGGTCGCCG TGATCTTTGC TGGCGTCAAC GGCTATCTCG ATAAGCTGCC GGTCGCTTCG
GTCGGCAAGT TCGAGCAGGG CTTCCTCTCA TATCTGCGTT CGGAAGGCTC CGCCATCCTC
GACGCGATCC GCACGGAAAA GGCAATCAGC GACGATACCA AGGGCAAGCT CAATGCTGCT
CTGGATAGCT TCGCAAAGTC TTTCTCGTAA
 
Protein sequence
MDIRAAEISA ILKDQIKNFG KEAEVSEVGQ VLSVGDGIAR VYGLDNVQAG EMVEFPGGIR 
GMALNLESDN VGVVIFGSDR DIKEGDTVKR TGAIVDVPVG PELLGRVVDA LGNPIDGKGP
INATRRSRVD VKAPGIIPRK SVHEPMSTGL KAIDALIPVG RGQRELVIGD RQTGKTAILL
DAFLNQKAIH DNGPEGEKLY CVYVAVGQKR STVAQFVKVL EERGALKYSI IVAATASDPA
PMQFLAPFAG CAMGEYFRDN GMHALIGYDD LSKQAVSYRQ MSLLLRRPPG REAYPGDVFY
LHSRLLERAA KMNDDKGAGS LTALPVIETQ GNDVSAFIPT NVISITDGQI FLETDLFYQG
IRPAVNVGLS VSRVGSSAQI KAMKQVAGSI KGELAQYREM AAFAQFGSDL DAATQRLLNR
GARLTELLKQ PQFSPLKTEE QVAVIFAGVN GYLDKLPVAS VGKFEQGFLS YLRSEGSAIL
DAIRTEKAIS DDTKGKLNAA LDSFAKSFS