Gene Rleg2_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2008 
Symbol 
ID6980747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2068243 
End bp2069532 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content59% 
IMG OID643396730 
Producthypothetical protein 
Protein accessionYP_002281518 
Protein GI209549601 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.574447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCC TTCCGCCGGG CTTCATATCG GATCGTTCGG GCAATTTCGG CATCATGACG 
GCATTGCTGA TGGTGCCGCT CCTCGGCACG GCAGGCATGG CGGTGGATTT CGCTCACGCC
ATGAGCCTGC GCACGCAGCT CTTTGCCGCC GCCGATGCTG CCGCCGTCGG CTCGATCGCC
GAAAAATCCG GCGCGGTCGC CGCCGCCATG ACCATGACCG GCAACGGCAC GATCTCTCTC
GGCAAGACCG ACGCCCGCAG CATCTTCCTG TCGCAGGTGT CGGGCGAGCT GGCGGATGTT
AATGTCGATC TCGGCATCGA TGTTACCAAG ACGGCCAACA AACTGAACTC GCAGGTTTCG
TTCACGGCGG TCGTGCCGAC CACCTTCATG CGCGTTCTCG GCAAGGATTC GATCACCATC
TCCGGCACGG CAACGGCCGA ATATCTGACC GCGTCGTTCA TGGATTTCTA CATCCTGCTC
GACAACACCC CCTCCATGGG CGTCGGCGCT ACCGCGAAAG ACGTCGCGAC GATGGAAAAG
AACACCAGCG ATAGCTGCGC TTTCGCCTGC CACGAAACGG AAAACAAAAA TAACTACTAC
AATCTTGCCA AGACGCTCGG CGTCAGCATG CGCATCGACG TCGTGCGCCA GGCAACAAAG
GAGCTGACGC TCACCGCAAA GTCGACGCGC GTTTCCACCA ACCAGTTCCG CATGGGCGTC
TATACTTTCG GCACCAAAGC CGAAGACGCC AATCTGACCA CCATATCGGA CCCGACGGAC
GACCTCGACA AGGTACGCAC CTACACTGAC GCGGTCGATT TGATGACTAT CCCGAAACAG
GGCTACAACA ACGACCAGCA GACGAGCTTC GACAACGCGC TGACGCAGAT GAAGGACATC
ATCACCACCC CCGGCGACGG CAGCACCGCC ACGACACCGC AGAAGATCCT GTTCTTCGTC
TCGGACGGCG TCGGCGACAG CGAAAAGCCG AAAGGCTGCA CCAAGAAACT GACCGGCAAC
CGCTGCCAGG AGCCGATCGA CACTTCCTTC TGCAAACCGC TGAAGGACAA GGGCATCAGG
ATTGCGGTGC TCTACACCAC TTATCTACCT TTGCCGAAGA ACAGCTGGTA CAATACGTGG
ATCAGCCCCT TCCAGAGCCA GATCCCGACG AAGATGCAGG AATGCGCCTC GCCTGGCCTC
TATTTCGAGG TGACGCCGAC CGAAGGTATC GCCGATGCGA TGAAGGCTCT CTTCCTCAAA
GCCATCCGCG CGCCGCGCAT TACCAGCTAA
 
Protein sequence
MAILPPGFIS DRSGNFGIMT ALLMVPLLGT AGMAVDFAHA MSLRTQLFAA ADAAAVGSIA 
EKSGAVAAAM TMTGNGTISL GKTDARSIFL SQVSGELADV NVDLGIDVTK TANKLNSQVS
FTAVVPTTFM RVLGKDSITI SGTATAEYLT ASFMDFYILL DNTPSMGVGA TAKDVATMEK
NTSDSCAFAC HETENKNNYY NLAKTLGVSM RIDVVRQATK ELTLTAKSTR VSTNQFRMGV
YTFGTKAEDA NLTTISDPTD DLDKVRTYTD AVDLMTIPKQ GYNNDQQTSF DNALTQMKDI
ITTPGDGSTA TTPQKILFFV SDGVGDSEKP KGCTKKLTGN RCQEPIDTSF CKPLKDKGIR
IAVLYTTYLP LPKNSWYNTW ISPFQSQIPT KMQECASPGL YFEVTPTEGI ADAMKALFLK
AIRAPRITS