Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5621 |
Symbol | |
ID | 6977012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 7839 |
End bp | 9737 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393078 |
Product | type II secretion system protein E |
Protein accession | YP_002277896 |
Protein GI | 209546006 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTGA AGATTTCCTA TGAGGACGGC AGCGGCCGGG AGGTGATCCC GCTCTCGGCA AACGAGACCT ATTTCGTCGG CGAAAGCAGC ACGCTGACGC TGCCGGCCGG CGCCGGCGTG GTGCGGCTGC GCGGCAGCCA CGTCTCCTCG CCGCAATTCG TGCTGCGCAA ATCCGGCCAG GGCTGGTCGG TGCAGCATCA CGGCCGCAAC CCGACCCGCG TCGACGATCA GCCGCTTCGA GCCGGCACGC CGGTTGCGGT TTCCGCCGGC ATGTCGATCT GGGTGCCGAA CGTCACGATC GAACTCGTCG AACCGGCCGC GGCCGCCGAA GCGGTGACGC AGTTTCCCGA TCAGGAACGC GTACTCGCCT TGCAGATGGA GATCCACGAG CGACTGCTGA AGGACACGCA ATACGACCGG CTGGTGAAAT CCTCTGATTT CGGCCGGGAG GATACGCGAA ACCGCATCCG CGAACGGCTC GACATGTTCA TCAAGGAAGC GCTCGACGGC GCGCCGCAGG ATCTCGTCAT CCTCGTCATC AAGAACGCCG TCTATCGGTG GCTGGCAAAA CGCATCGCCC GCACCGGGCG GCGCGACGCC TCGTCAACTG CCGCCAGCTT GTCGCGCGAG GAGCAGGACA ATCGCCGGCT GTTCGACGTC GGCAAGGCGC TGATCTCGGC GCTGCAGCTG AAGCTGAATT TCGAATCCAC CCGCGCCGAC TTCGCCCAGC TCGACACCCG CTTCAGCGCC GCCTTCCAGT CCCGCCAGGC GCTGTTCAAT GCCGGCGACC GCTACGAGAT CGCCCATATG CATCTACGTT CGAATATCGA AGAGCTGATG TACCGCTGGG GCACGATCTC GGAACTGATG GATCTCGATG TCATCTCGGA AATCATGGTG ACGCGTTATG ACGAGATCTA TGTCGAAAAA TTCGGCCTGC TGGAACGTTA TCCCTTTGCC TTCGCTAATG AGCGGCAGCT GATGAAGGTG ATCGAGCGCA TCGCCGTCGA TTCCAACCGC TCGATCAACG AGAGCGAGGC GATGGCCGAC TTCCGCATGC CGGATGGATC GCGCGTCAAC GCCGTCATTC CGCCGCTGGC GGTCAAGGGC GCTTGCCTCA CCATCCGCAA GTTCGGCGGC AAGTCGCGGC TCGACATCTC CAAGCTGGTC ACGGCCGGCG CACTCAGCGA GCCGATGCGC GCCTTCCTCG AGGCGGCGGT CCGCGCCCGC AAGAACATCG TCGTCTCCGG CGGCACCGGC TCCGGCAAGA CGACGCTCTT GAACAGCCTG TCGCAGTTCA TCCCGATCGG CGAGCGCGTC GTTGCCGTCG AGGACACGTC GGAACTGCAG CTCGACGGCA TTCATGTCGT CTATCTGCAA TCGCGGCCGA AGACGGCGGA GTCCGAAACC AGCGTCACCA TCCGCGATCT CGTGCGCAAC GCGCTGCGCA TGCGCCCCGA CCGCATCATC GTCGGCGAGT GCCGCGGTGC CGAGGCGATC GATATGCTGC AGGCGATGAA CACCGGCCAT GCCGGCTCGA TGACGACGGC GCATGCCAAT ACGCCGCAGG ACATGATGAC CCGCCTGGAG GTGATGGTGC TGCAGGGGCA GAGCTCGCTG CCGGTCATGG CGATCCGCCA GCAGATCGTT GCCGCCGTCG AGCTCGTCGT GCAGCTGAAC CGCCTGGAAA GCGGGCGGCG CGCGGTGACC GAAATATCCG AGGTGATCGG CATCGATCCG GATACCGGCC TCGTCATCGT CGAGCCGATC TTCCATCTGG TCGGGCGCGC CGGCGGCCAG GCGGTGCATG CCTTCACCGG CTATCTGCCG AGCTTCGTCG CCGAGCTCGT CGAGTTCGGC GAAGACGGCG AAATCGAAAA ACTCGATATG TTCGTTTAG
|
Protein sequence | MLLKISYEDG SGREVIPLSA NETYFVGESS TLTLPAGAGV VRLRGSHVSS PQFVLRKSGQ GWSVQHHGRN PTRVDDQPLR AGTPVAVSAG MSIWVPNVTI ELVEPAAAAE AVTQFPDQER VLALQMEIHE RLLKDTQYDR LVKSSDFGRE DTRNRIRERL DMFIKEALDG APQDLVILVI KNAVYRWLAK RIARTGRRDA SSTAASLSRE EQDNRRLFDV GKALISALQL KLNFESTRAD FAQLDTRFSA AFQSRQALFN AGDRYEIAHM HLRSNIEELM YRWGTISELM DLDVISEIMV TRYDEIYVEK FGLLERYPFA FANERQLMKV IERIAVDSNR SINESEAMAD FRMPDGSRVN AVIPPLAVKG ACLTIRKFGG KSRLDISKLV TAGALSEPMR AFLEAAVRAR KNIVVSGGTG SGKTTLLNSL SQFIPIGERV VAVEDTSELQ LDGIHVVYLQ SRPKTAESET SVTIRDLVRN ALRMRPDRII VGECRGAEAI DMLQAMNTGH AGSMTTAHAN TPQDMMTRLE VMVLQGQSSL PVMAIRQQIV AAVELVVQLN RLESGRRAVT EISEVIGIDP DTGLVIVEPI FHLVGRAGGQ AVHAFTGYLP SFVAELVEFG EDGEIEKLDM FV
|
| |