Gene Rleg2_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3863 
Symbol 
ID6982626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4005793 
End bp4007223 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content60% 
IMG OID643398585 
Producthypothetical protein 
Protein accessionYP_002283351 
Protein GI209551434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00584301 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.146165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAA AGACAACTCT CAACGCGAAA AACCTGGAAA CCCTCGGCGC GCAACGTCTC 
GCCGAACTGC TGATCGAGAT CAGCACGGGC AGCGCTGCCC ACAAACGGCG GCTTCGCATG
GAGCTGGCCG GCGATCAAGG CAGTGCGGAA GTGGCTCGTG AGATCCGCAA ACGTCTCGCC
AGCATCGCGA GGGCACGAAC CGTCATCGAA TGGCACAAGG TGAAGAAGAT CAACGCGGAC
CTCGAAACGC AGCGGTCCGC GATCGTTACC GTCGTGGCCG CCGATGATCC GAAGGAAGCG
TTCGATCTCA TTTGGCAATT CCTTGCCGTT GCCGATTCGA TATTCCACCG ATCCAACAGC
AACGACATCG CCTTCACCGA GACGTTCCAT CAGGCATGCG CCGATGCCGC CGTCATCGCC
AGCTCCGTGG GGATCGATAT CGATGTCCTT GCCTACAAGG TTTTCGCTGC GTTGCAGGAT
AACGACCACG GCCAATATGA CCCGCTGATC GCCGAGATGG TTCCGGTGCT TGGTAAGGAC
GGCCTGGAAC GTTTGAAGGG TCTCCTGGTG CAATGGTTGA ACGAAGAGGA GAAGGAGTCT
GCCGACGCCG ACCGGGAAAC TCTCGACTGG GGCGGCGGCG GAACGACCTA CCTGGACGAG
ATCTATGCCA GGCATCGGCA ACAAAGGGCG CGCATCGCTC TGCAGGACAT TGCCGATGCC
CAGGATGATG CTGATGCCTT CATCGCCCAG CAGCCGGAAG AGACCTTAAG GATGCCGATG
GTCGCGATCG CGATTTCCGA CCGCTTGCTC CTGGCTGGAC GCGCCGAGGA AGCATTGAGG
ATTTTGGACG GCGTTGATCA CCGGTTCGAA ATGCCTTTTG AATGGCAGGA AGCCCGCGTC
GAAGTGCTCG AAGCGCTCGG ACGGGGGGAG GAAGCGCAGG CCTATCAATG GCAATGTTTC
GAGCAGTCGC TGAACGGGGA GCACCTTCGC GCTTTCCTGC GAAAACTCGC CGACTTCGAC
GACATCGAAG CCGAGGAAAA AGCATTCGCC TTCGCGCACG GCTTTCCGGA TGTTCATCGA
GCCCTTGCCT TTTTCCTTGC CTGGCCCACA CCGGCCGAGG CAGCGAAACT CATCTCCAAG
CGCCAGGCGG AACTGGACGG CAACCTGTAC GAGCTGATGA CGCCGGCCGC CGAGATGCTA
CAGGAAAAAC AGCCGCTCGC GGCAACCATT CTGCTGCGAG CGATGATCGG CTTCGCCCTC
GATTACGGCC GATCCAGCCG TTACAGGCAT GCCGCGCGCC ACCTGGCGGA ATGCGCCTCG
CTCGCACCGC ATATCGACGA TTTCGGCAAC GCCCGACCGC ACGACGCCTA TGTGGTCGAG
CTGAAGCGCC AACATGGAAA CAAGCACGGC TTCTGGAGCC TGCTGCGTTA A
 
Protein sequence
MAAKTTLNAK NLETLGAQRL AELLIEISTG SAAHKRRLRM ELAGDQGSAE VAREIRKRLA 
SIARARTVIE WHKVKKINAD LETQRSAIVT VVAADDPKEA FDLIWQFLAV ADSIFHRSNS
NDIAFTETFH QACADAAVIA SSVGIDIDVL AYKVFAALQD NDHGQYDPLI AEMVPVLGKD
GLERLKGLLV QWLNEEEKES ADADRETLDW GGGGTTYLDE IYARHRQQRA RIALQDIADA
QDDADAFIAQ QPEETLRMPM VAIAISDRLL LAGRAEEALR ILDGVDHRFE MPFEWQEARV
EVLEALGRGE EAQAYQWQCF EQSLNGEHLR AFLRKLADFD DIEAEEKAFA FAHGFPDVHR
ALAFFLAWPT PAEAAKLISK RQAELDGNLY ELMTPAAEML QEKQPLAATI LLRAMIGFAL
DYGRSSRYRH AARHLAECAS LAPHIDDFGN ARPHDAYVVE LKRQHGNKHG FWSLLR