Gene Rleg_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1903 
Symbol 
ID8012951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1888817 
End bp1889893 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID644824492 
Productbasic membrane lipoprotein 
Protein accessionYP_002975724 
Protein GI241204628 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.94528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.373123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TAGCATTCAC ACTTGCCGCC TCGGCTGCCG CCGTGATCGG CATCTCTTCC 
GCCGCGCAAG CCGCGGACAA GACGAAGGTC TGCTTCGTCT ATGTCGGTTC GCATACCGAC
GGCGGCTATT CACAGGCCCA CGACCTCGGC CGCCAGCAGA TCCAGGCCGA GTTCGGCGAC
AAGATCGACA CGCCTTATCT CGAAAACGTG CCGGAAGGCC CGGATGCCGA GCGCGCCATC
GAGCGCCTTG CCCGTTCCGG CTGCAAGCTG ATCTTCACGA CGTCCTTCGG CTTCATGGAC
GCGACCGTCA AGGTCGCCGC CAAGTTCCCG GACGTGAAGT TCGAGCATGG CACCGGCTAC
AAGGCCGGCC CGAACCTTGC GACCTACAAT TCGCGCTTCT ATGAAGGCCG CTACATCCTC
GGCCAGATCG CCGCCAAGAC CTCGAAGAAT CACGGCGCGG CCTACATCGC CTCCTTCCCG
ATTCCCGAAG TCGTGATGGG CATCAACTCG TTCGAACAGG GCGCCAAGTC GGTCGATCCG
AGCTTCAAGC TGAAGGTCAT CTGGGTCAAC ACCTGGTTCG ACCCCGGCAA GGAAGCCGAT
GCCGCCAAGG CGATGGTCGA CCAGGGCGTC GACGTCTTGA CGCAGCACAC CGACACGACT
GCGCCGATGC AGGTCGCCGA AGAACGCGGC ATCCACGCCT TCGGCCAGGC CTCCGACATG
ATCGCAGCAG GCCCGAAGGC TCAGCTGACG GCAATCGTTG ACACTTGGGG GACCTACTAC
TCCAAGCGCG TTCACGCTCT TCTGGACGGC ACCTGGAAGT CCGAGCAGAG CTGGGACGGC
CTGAAGGACG GCATCCTGAA GATGGCGCCC TATACCAACA TGCCCGACGA CGTGAAGAAG
ATGGCCGAGG AAACCGAAGC CAAGATCAAG TCAGGCGAAC TGCATCCCTT CACCGGCCCG
ATCAACAAGC AGGACGGAAC GCCCTGGCTG AAGGCTGGCG AGAAGGCCGA TGACGGCACG
CTGCTCGGCA TGAACTTCTA TGTCGAAGGC GTCGACGATA AGCTGCCGGG TAAATAG
 
Protein sequence
MKKLAFTLAA SAAAVIGISS AAQAADKTKV CFVYVGSHTD GGYSQAHDLG RQQIQAEFGD 
KIDTPYLENV PEGPDAERAI ERLARSGCKL IFTTSFGFMD ATVKVAAKFP DVKFEHGTGY
KAGPNLATYN SRFYEGRYIL GQIAAKTSKN HGAAYIASFP IPEVVMGINS FEQGAKSVDP
SFKLKVIWVN TWFDPGKEAD AAKAMVDQGV DVLTQHTDTT APMQVAEERG IHAFGQASDM
IAAGPKAQLT AIVDTWGTYY SKRVHALLDG TWKSEQSWDG LKDGILKMAP YTNMPDDVKK
MAEETEAKIK SGELHPFTGP INKQDGTPWL KAGEKADDGT LLGMNFYVEG VDDKLPGK