Gene Rleg_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2227 
Symbol 
ID8013234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2232123 
End bp2233412 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content59% 
IMG OID644824813 
Producthypothetical protein 
Protein accessionYP_002976043 
Protein GI241204947 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4961] Flp pilus assembly protein TadG 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.410036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCC TTCCGCCCGG TTTCATATCA GACCGCTCGG GCAATTTCGG CATCATGACG 
GCACTGCTGG TGGTGCCGCT CTTCGGTGCG GCCGGCATGG CGGTGGATTT CGCCCACGCG
CTCAGCCTGA GGACGCAGCT CTACGCCGCT GCCGATGCTG CCGCCGTCGG TTCGATCGCC
GAAAAATCCG GCGCCGTCGC AGCCGCCATG ACCATGAGCG GCAACGGCAC GATCTCGCTC
GGCAAGGACG ACGCCCGCAG CATCTTCATG TCTCAAATAT CCGGGGAGCT GACCGACGTT
CAGGTCGATC TCGGAATCGA TGTCACCAAG ACCGCCAACA AGCTGAATTC GCAGGTTTCC
TTCAGTGCGA CTGTGCCTAC CACCTTCATG CGCGTTCTTG GCCGGGATTC GATCACGATC
TCTGGTACAG CGACGGCCGA ATACCAGACC GCGTCTTTTA TGGATTTCTA CATTCTCCTC
GACAACACCC CTTCGATGGG CGTTGGCGCC ACCGCGACAG ACGTCTCGAC GATGGAAAAA
AACACCAGCG ATACCTGCGC TTTCGCGTGC CATGAAACGC AGAACAACAA CAATTATTAC
AATCTCGCCA AGAAGCTCGG CGTCAGCATG CGCATCGACG TCGTGCGCCA GGCGACCAAG
GAACTGACGG TGACCGCCAA GTCCACGCGT GTTTCCAGCA ATCAGTTCCG CATGGGCGTC
TATACGTTCG GCACCAAGGC CGAGGATGCA AAGCTGACCA CCATATCCGA CCCGACGGAC
GATCTCGACA AGGTGCGCAG CTATACCGAC GCCGTCGATC TCATGACCAT TCCGTTTCAG
GGCTATAACA ACGACCAGCA GACGAGCTTC GACAGCGCGC TGACGCAGAT GAAAACCATT
ATCACCACCC CCGGCGACGG CAGCACCGCC ACGACACCGC AGAAGATTCT TTTCTTCGTC
TCGGACGGCG TCGGCGACAG CGAAAAACCG AAAGGCTGCA CCAAGAAACT CACCGGCAAC
CGTTGCCAGG AGCCGATCGA CACGTCCTTC TGTCAGCCAC TGAAGGACAA GAGTATCAGG
ATCGCGGTGC TCTACACCAC CTATCTGCCG CTGCCGAAAA ACAGCTGGTA CAATACGTGG
ATCAAGCCTT TCCAGGGCGA GATCCCGACG AAGATGCAGG CATGCGCCTC GCCCGGCCTC
TATTTCGAAG TGACGCCGAC CGAAGGCATC GCCGATGCGA TGAAGGCGCT TTTCCTCAAG
GTCATCCGGG CACCGCGCAT CACCAGCTAG
 
Protein sequence
MAILPPGFIS DRSGNFGIMT ALLVVPLFGA AGMAVDFAHA LSLRTQLYAA ADAAAVGSIA 
EKSGAVAAAM TMSGNGTISL GKDDARSIFM SQISGELTDV QVDLGIDVTK TANKLNSQVS
FSATVPTTFM RVLGRDSITI SGTATAEYQT ASFMDFYILL DNTPSMGVGA TATDVSTMEK
NTSDTCAFAC HETQNNNNYY NLAKKLGVSM RIDVVRQATK ELTVTAKSTR VSSNQFRMGV
YTFGTKAEDA KLTTISDPTD DLDKVRSYTD AVDLMTIPFQ GYNNDQQTSF DSALTQMKTI
ITTPGDGSTA TTPQKILFFV SDGVGDSEKP KGCTKKLTGN RCQEPIDTSF CQPLKDKSIR
IAVLYTTYLP LPKNSWYNTW IKPFQGEIPT KMQACASPGL YFEVTPTEGI ADAMKALFLK
VIRAPRITS