Gene Rleg_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3961 
Symbol 
ID8014775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4037608 
End bp4038684 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content62% 
IMG OID644826530 
ProductGumN family protein 
Protein accessionYP_002977741 
Protein GI241206645 
COG category[S] Function unknown 
COG ID[COG3735] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0233081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAT CAATCGGCCG CCGGACGTCT TACCTCGCAG TGCCGGCCAA TCTGTTGCTC 
TGGCTGATCG CAGCCTTTCA CATGCTGCTT CTTGCCGCCC TGTTTGCCGC CTTCCTGACG
GCAAGGCCGG CAGCGGCCGA AGACGTCGCC TGCACCGGCC GCAATCTGAT GGTCGAGCTG
CAGCAGAACG ACCCTGCCCG CTACGCAGAG GCGCTGAAGG AAGCCGACGC CACGCCGAAC
GGCAAGGGTA TCTTCTGGAA GATCGAGAAG CCGGGATTGG CACCCTCGTG GCTGCTCGGC
AGCATGCATG TCACCGATCC GCGCGTGCTG GCCCTGCCGC CGCGCGCCCA GGCAGCCCAC
GATGCCGCCG ACACGATCAT CATCGAATCC GACGAGATCC TCGATGAGCG GAAGGCGACC
GCCGCCCTGC TTGCAAAGCC GGAACTGACG ATGTTCACCG ACGGCACGAC GATCGACAAG
CTGCTTTCTC CCGAGGACTA CAAGCGTCTC GAAACCGGCC TCAAGCAGCG CGGTATCCCG
ATCAGTACCG TTTCCCGGAT GCGGCCCTGG ATGATTTCCA GCGCCGTCGC CCTGCCGGCC
TGCGAAATCG CCCGCAAGGC AAAAGGCGCG CAGTTCCTCG ACCAGAAGAT CGCCACCGAT
GCCATTGCTC AGGGCAAACA GGTCAAGGGG CTGGAAACCC TTGCCGAGCA GATCCAGGCC
ATGGCCGATC TGCCGGTCGA ATTCCATCTG AAATCGCTGA TCGAGACGCT GGAACTCGGC
GACAAGATGA GCGATGTCGT CGAGACGATG ACCGACCTCT ACCTCTCGGG TGATATCGGC
ATGACCATGC CGATGCTGAA AACCGTGACA CCGGAGGAGG AAGGTGAAAA CAGCGATTAT
GCCGCCTTCG AGCAGCGCGT CATCCTTGAC CGCAACAAGG TGATGGCCGA GCGCGCAGCG
CCCATCCTCG ACAGCGGCAA CGTCTTCATG GCCGTCGGTG CCCTGCATCT GCCCGGCAAG
GACGGCGTCA TCGAACTGCT GCGCCAGCAG GGCTTCACCG TAACAGATGT AAATTAA
 
Protein sequence
MTTSIGRRTS YLAVPANLLL WLIAAFHMLL LAALFAAFLT ARPAAAEDVA CTGRNLMVEL 
QQNDPARYAE ALKEADATPN GKGIFWKIEK PGLAPSWLLG SMHVTDPRVL ALPPRAQAAH
DAADTIIIES DEILDERKAT AALLAKPELT MFTDGTTIDK LLSPEDYKRL ETGLKQRGIP
ISTVSRMRPW MISSAVALPA CEIARKAKGA QFLDQKIATD AIAQGKQVKG LETLAEQIQA
MADLPVEFHL KSLIETLELG DKMSDVVETM TDLYLSGDIG MTMPMLKTVT PEEEGENSDY
AAFEQRVILD RNKVMAERAA PILDSGNVFM AVGALHLPGK DGVIELLRQQ GFTVTDVN