Gene Rleg_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3648 
Symbol 
ID8014496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3688632 
End bp3689834 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content57% 
IMG OID644826211 
Producthypothetical protein 
Protein accessionYP_002977430 
Protein GI241206334 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCA CCTCCTTTTT GCATCCCTGC CTGCGTCGCA TGCTTGGCGA CCGTGGCGGT 
AATTTCGGTA TAATGACGGC GATCGTATTG CCCGTCCTCT TTGGTGCTGC CGGCATGGCG
ATCCAGGTGG GCGACCTATT GCTCTCGAAA CAGCAATTGC AGGAAGCCGC CGATTCTGCC
GCGCTGGCAA CCGCGACCGC CCTGGCAAAC GGCACGATCC AAACCTCGCA GGCCGAGGCC
TTCGCGCGGG ATTTCGTCGC CGGACAAATG GCGAATTACC TGCAGAGCGG CATCGATATC
AAGAGCACGA CGGGCGTCGA TGTCCGGACA ACGACATCGG GCAAGTCGAC ATCCTATCAA
GTGACGGTTT CGCCCGACTA CAACATCGCG GTCAATCCGC TGATGCAGAC GATCGGGTTC
ACGACGCAGA ACATTTCAAC GTCGAGCACG ACGACCAGCG GCAATTCGCA AACCCAAGGC
TCCGTCTCGA TGTTCCTCGT CCTCGACAGA TCCGGCTCAA TGGGCGAGGA TACCGCGACG
GTCAACGCGT CAGATCCTAC GGAAGAATAC AATTACGACT GCAGCGAAAA AGACAGATAC
GGCAACGTAA CCAAGAAGAA GACCTGCACC GATACGCGTC CTCACTACTA CACCAAAATC
GAAGCCCTAA AGCTTGCCGT CGGCACGCTG ACGGGCGAAC TCGACGCAGT CGATCCCGAA
AAGGAATATG TCCGCACGGG TGCCGTTTCC TACAATATCG AGATGCAGAA AGCGAAAGCG
CTGGATTGGG GAACGGCCCA CGTCACCAAA TATGTCAACA AGCTGACGGC GACCGACGGT
ACGGACTCTG GCGAGGCATT CAAGACGGCC TATAATAAGC TTGCCGACGC GGCCGAAGAC
AAGGCGCACG TGGACAAGAC GGGACAGGTT CCGACAAAAT ACATCGTCTT CATGACGGAC
GGAGACAACA ACTATACCTC CGCCGACACC GAAACGAAGA CGTGGTGCGA CAAGGCGCGC
GACGCCAAGA TGCAGGTCTA TACGATTGCC TTTATGGCGC CGGCGCGTGG CCAGGCTCTG
CTGAGCTATT GCGCGACAGC GCCGGGCAAC TATTTTCCCG CTGGCGACAT GACGGCGCTC
CTGAAGGCAT TCAAGGAAAT CGGCATGAAA GCATCCAATC AGGTCACGCG TCTGACGAAC
TGA
 
Protein sequence
MMSTSFLHPC LRRMLGDRGG NFGIMTAIVL PVLFGAAGMA IQVGDLLLSK QQLQEAADSA 
ALATATALAN GTIQTSQAEA FARDFVAGQM ANYLQSGIDI KSTTGVDVRT TTSGKSTSYQ
VTVSPDYNIA VNPLMQTIGF TTQNISTSST TTSGNSQTQG SVSMFLVLDR SGSMGEDTAT
VNASDPTEEY NYDCSEKDRY GNVTKKKTCT DTRPHYYTKI EALKLAVGTL TGELDAVDPE
KEYVRTGAVS YNIEMQKAKA LDWGTAHVTK YVNKLTATDG TDSGEAFKTA YNKLADAAED
KAHVDKTGQV PTKYIVFMTD GDNNYTSADT ETKTWCDKAR DAKMQVYTIA FMAPARGQAL
LSYCATAPGN YFPAGDMTAL LKAFKEIGMK ASNQVTRLTN