Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7220 |
Symbol | |
ID | 8022926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 645295 |
End bp | 647193 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644834052 |
Product | type II secretion system protein E |
Protein accession | YP_002985186 |
Protein GI | 241667102 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.177916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.420932 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTTGA AGATTTCCTA TGAGGACGGC AGCGGTCGAG AGGTCATCCC GCTCTCGGCG AACGAAACCT ATTTCGTCGG CGAGAGCAGC ACGCTGACGC TGCCCGCCGG CGCGGGGGTC GTGCGGTTGC GTGGCAGCCA CGTCTCTTCG CCGCAATTCG TGCTGCGGAA GTCCGGCCAG GGATGGTCGG TGCAGCATCA CGGACGCAAC CCGACGCGGG TCGACGATCA GCCGCTTCGG GCCGGCACCC CGGTTGCAGT CTCGGCCGGC ATGTCGATCT GGGTGCCGAA TGTCACGATC GAGCTCGTCG AGCCGGCCGC GGCCGCCGAG GTGGTGACGC AATTTCCCGA TCAGGAACGC GTGCTCGCCC TGCAGATGGA AATCCACGAA CGTCTGCTGA AGGACACGCA ATATGACCGG CTCGTCAAAT CCGCCGATTT CGGCCGCGAG GACACGCGAA ACCGCATCCG CGAACGCCTC GACATGTTCA TCAAGGAGGC GCTCGACGCA GCACAGCAGG ATCTCGTCAT CCTTGTCATC AAGAACGCCG TCTATCGGTG GCTGGCCAAA CGCATTGCCC GAACGGGGCG GCGCGACGCA TCGTCGAATG CCGCAAGCCT GTCGCGCGAG GAGCAGGACA ATCGCCGCCT CTTCGACGTC GGCAAGGCGC TGATTTCGGC GCTGCAGCTC AAGCTCAACT TCGAATCCAC CCGCGCGGAC TTTGCCCAGC TCGACACCCG TTTCAGCGCA GCTTTCCAGT CCAGGCAAGC GCTTTTTAAC GCCGGTGACC GCTATGAGAT CGCCCATATG CACCTGCGCT CCAGCATCGA GGAGCTGATG TATCGCTGGG GAACGATCTC CGAGCTGATG GATCTCGACG TGATCTCGGA AATCATGGTG ACGCGCTACG ACGAGATCTA CGTCGAAAAA TTCGGCCTGC TGGAGCGCTA TCCCTTCGCC TTCGCCAATG AGCGGCAGCT GATGAAGGTG ATCGAGCGCA TCGCCGTCGA TTCCAACCGC TCGATCAACG AGAGCGAGGC GATGGCCGAC TTCCGCATGC CGGATGGCTC TCGCGTCAAC GCCGTCATTC CGCCGCTGGC GGTCAAGGGC GCCTGCCTCA CCATCCGCAA GTTCGGCGGC AAGTCGCGGC TCGATATCAG CAAACTGGTG ACCGCCGGCG CGCTCAGCGA GCCGATGCGC GCCTTCCTCG AGGCGGCCGT CCGCTCCCGC AAGAACATCG TCGTCTCAGG CGGCACCGGC TCCGGCAAGA CGACGCTCTT GAACAGCCTG TCGCAGTTCA TTCCGGTGGG CGAGCGCGTC GTTGCCGTCG AAGACACGTC GGAACTGCAG CTCGACGGCA TTCATGTCGT CTATCTTCAA TCGCGGCCGA AGACGGCGGA GTCGGAGACC AGCGTCACCA TCCGCGACCT CGTGCGCAAC GCGCTGCGCA TGCGTCCCGA CCGCATCATC GTCGGCGAGT GCCGCGGCGC CGAGGCGATC GACATGCTGC AGGCGATGAA CACCGGCCAT GCCGGCTCGA TGACGACGGC GCATGCCAAT ACGCCGCAGG ACATGATGAC CCGCCTGGAG GTGATGGTGC TGCAGGGGCA GAGCTCGCTG CCTGTCATGG CGATCCGCCA GCAGATCGTT GCCGCGGTCG AACTTGTCGT GCAGCTGAAC CGCCTGGCAA ACGGTCGGCG CGCCGTCACC GAAATATCGG AGGTGATCGG TATCGATCCG GATACCGGCC TCATCATCGT CGAGCCGATC TTCAATCTCG TCGGCCGTGC CGGCGGCCAG GCCGTGCATG CCTTCACCGG CTACCTGCCG AGCTTCGTCG CCGAGCTCGT CGAGTTCAAC GACGACGGCG AGATCGAAAA ACTGGACATG TTCGTCTAG
|
Protein sequence | MLLKISYEDG SGREVIPLSA NETYFVGESS TLTLPAGAGV VRLRGSHVSS PQFVLRKSGQ GWSVQHHGRN PTRVDDQPLR AGTPVAVSAG MSIWVPNVTI ELVEPAAAAE VVTQFPDQER VLALQMEIHE RLLKDTQYDR LVKSADFGRE DTRNRIRERL DMFIKEALDA AQQDLVILVI KNAVYRWLAK RIARTGRRDA SSNAASLSRE EQDNRRLFDV GKALISALQL KLNFESTRAD FAQLDTRFSA AFQSRQALFN AGDRYEIAHM HLRSSIEELM YRWGTISELM DLDVISEIMV TRYDEIYVEK FGLLERYPFA FANERQLMKV IERIAVDSNR SINESEAMAD FRMPDGSRVN AVIPPLAVKG ACLTIRKFGG KSRLDISKLV TAGALSEPMR AFLEAAVRSR KNIVVSGGTG SGKTTLLNSL SQFIPVGERV VAVEDTSELQ LDGIHVVYLQ SRPKTAESET SVTIRDLVRN ALRMRPDRII VGECRGAEAI DMLQAMNTGH AGSMTTAHAN TPQDMMTRLE VMVLQGQSSL PVMAIRQQIV AAVELVVQLN RLANGRRAVT EISEVIGIDP DTGLIIVEPI FNLVGRAGGQ AVHAFTGYLP SFVAELVEFN DDGEIEKLDM FV
|
| |