Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_4239 |
Symbol | |
ID | 5110444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009425 |
Strand | - |
Start bp | 53795 |
End bp | 56797 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640480856 |
Product | virulence protein SrfB |
Protein accession | YP_001165518 |
Protein GI | 146284565 |
COG category | [S] Function unknown |
COG ID | [COG4457] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.104797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCTG AACTTACTGA TTTTAAAAAG CAGGTCAAAA TTATTCGTGA CAGTGGCATC CAGTTCCTGG ATTTCGCGTT TACGCTGCCT GCCCGTAAAG AGTACGGTGA TTTTTTGCGC CAGAACGGCG ATGGCGCCAT CCTGCGCCTG GTCTATAACG AACGCGAAGA TAAACTGCAA ATCCCGGCGG CAGATAATGC CGCGCCACAA TTTGCTGAGT CGGATTACTC TCTGACCACG GAAGAGTCAC TGCACCTCTA TCAGGGGCTG TGGCTCCCGC TGCCCTTCTT CCGCTTTAAC CCGCCGCGTG CGTTTGCACA CGGGCCAACC AACTGGGCGC GCGTGCAGTT CCACGAGCTG GCAGAACCCG ACGAGAAGGG CAACACCTGG CGCGCGATCC TGATCTTCGA TACCAAAATT TTCCCGGATC GCGATAACAC GCAATATCTG GCTCCCAGCG AAGATGACGT GCGCTCTGGC GCCGGTTTTG CGCTGGCCCT GCACCCGCAC GAGATGGGCG ATTTCCTGAC CCTCCCGTGG GTAGACGAGT GGTTGCGTGA AGTGTTCAGC ACCCAGGCGC GCGACGTGCT GCGCCAGCAT GCGGAAGATA TTGATGAAAA ACTGGGGCAA AAAGAGCACC AGGCGCACTA CCTCAACCTG CTGAATATCC TTGATGCCAC CGTGGCCATC CCGGAAGTGC AGGTCAATGA CGTGAAGATC CGCGACTCGG CCATTCCGGT TGATTTGGTG CTGGATATCG GCAACTCCCG CAGTTGCGGA ATCCTGATTG AAGAGCACCG CGACGACAAC AAAGGTCTGT CCCAGCTTTA CCAGCTGCAA CTTCGCGACT TAAGCCAGCC GCAAAACGTC TACAACGAAC CCTTCGACAG CCGCCTGGAG TTTGCGCAGG CCGAGTTTGG CAAGCAGGAC TTTTCGCTGA AAAGCGGACG CAGCGACGCC TTCACCTGGC CGACCATTGG CCGCGTGGGC GATGAGGCCT TCCGCATGGC AGCACAGCGC CTGGGAACAG AAGGCTCAAC CGGGATCTCC AGCCCGAAAC GCTATCTGTG GGACGACCAG CCTTACTCCC CCGGCTGGCG TTTCAGCCAG GCGTTCGTCA AATCCGATCG CGAGCCGCTC GCCACCGCCG CCCCGCTGCT GTACATGCTT AACGACCAGG GCAAGCTGCT GATTCGTCTG CCTGAAGATG AGAGGATGCC GGTCTTCTCC CCGGTTTACA GCCGCAGCTC GTTAATGACC ATGATGCTCT CTGAAGTCCT GAGCCAGGCG CTGATGCAGA TCAACAGCCC GGCGCAGCGT CTGAAGATGA ACCACGCCAG CACGCCGCGC CGCTTGCGCA ATGTGATCAT GACCGTCCCG CCCGCGATGC CAAAACCTGA ACGCGCCATC TTTGAGCAAT GCATGCTGGA TGCCATCCGT CTGGTGTGGA AAGCGCTGGG CTGGGAGGAG ATGGACGACG AAAGTGAAGA CAACGAACAG CTGAAACATC CGCGACCTGG CGTTCACGTG AAGTGGGACG AAGCCACCTG CGGACAGCTG GTCTATCTGT ACAACGAAAC CCAGACCTAC TTTGGTGGGC GTACCGATGA GTTCTTTGCG GCGACCCGTC GCCCGGACAA CGCCCCTGCC GCAAACGCGC CGCGCAGCCT GAAGGTGGCC TCCATTGATA TTGGCGGCGG CACCACTGAC CTGGTGATCT CGCGCTATAC GCTGGATGAC GGCGAAGGCA TCAACGTCCG TATCACGCCA AAACAGCTGT TCCGCGAAGG CTTCAAAGTG GCGGGAGACG ACATCCTGCT GGATGTGATC CGCCTGTACC TGCAACCGGC TGTGAAAGCC GCGATCGTGA AGGTTGGCCA TAGCGATATG ACCGCAGAAT CGATGATGTC GCAGCTGTTT GGCAGCGAAT CCATCGAGGC GGGCAAGCAG GTTCTGCGTC AGCAGCTGAC CCTGCAGATC TTCGCCCCAC TGGCGCTGGC GATCCTGCAT CGCTATGAAG AGTATTCGCC GGAGAGCGGC CGCGAGCAGC TCAGCTTCAC GTTCCGCGAT CTGCTGACCG AAAACATGCC GACGCAAAAA GTGCAGGACT ACGTGAATGA CGTGGTGCGC ATGGGTCAAC TGTCCAGCGA AGCGCTCTTC TCTATTCTCG ATGTGCCGCT GGAGATCGAT CTGGCCAACC TGCACAACGA ATTCATTAAC CCACGCAGTG GCCGCATGAA CATCTGCCAC AGCCTGCGCG CCCTGTGCGA AGTGCTCTGG CACTATAACT GCGACGTGCT GCTGCTGACG GGTCGTCCGT CGCGTCTCCC GGGGATTCAG GCGCTGATCC GCCAGCTTCA GCCGGTGCCG CCATCGCGCG TGCTGCCTCT GCATGGCTAT GAGACCGGTG GCTGGTATCC GTTCAACAAG AAAGGCTGTA TCGACGACCC GAAATCTACC GCTTCGGTTG GGGCGATGCT GTACCTGTTG GCAGAGAACT CGCGGTTGAG CAGCTTCTTC TTCCGCACCC AGAACTTTGT GCCTTATTCC ACCATCCGCT ATCTCGGCAT GCTGGACGGC AACAACCTGA TCAAAGACAG CAACGTCTTT TATCGCGATA TCGATCTCGA CGCGCCGGCG TTCCAGCTTC CGCAGGGGCA AAGCTTTGAC GCGCGTGGCG AAGTGCGCAT TGGTTTCCGC CAGCTTGATA ACGAGCGCTG GCCTGCCTCT GCACTTTACA CGCTGAAAAT CGCCAACCCT AACCTCGCGA GCGAGCTGGC CGGGGATGCC ATGATGCGCA TCGAACTCAC CGCCGAGCAG GGACGCCCAC GCAACGGCAT CGAGGCGGTC AGTCCCGAGA AGTTCCGCAT TGAATCCCTC GAGACGGATC ATGCCCGTCG CAACTACAAC CGCAAAGACG TGGCCTTCCA GTTGAACACT ATGGTCGGTA ACGGTCTCAG CGAGACGCAC TACTGGCTGG ATAGCGGGAG TATCAAAAGC TAA
|
Protein sequence | MLAELTDFKK QVKIIRDSGI QFLDFAFTLP ARKEYGDFLR QNGDGAILRL VYNEREDKLQ IPAADNAAPQ FAESDYSLTT EESLHLYQGL WLPLPFFRFN PPRAFAHGPT NWARVQFHEL AEPDEKGNTW RAILIFDTKI FPDRDNTQYL APSEDDVRSG AGFALALHPH EMGDFLTLPW VDEWLREVFS TQARDVLRQH AEDIDEKLGQ KEHQAHYLNL LNILDATVAI PEVQVNDVKI RDSAIPVDLV LDIGNSRSCG ILIEEHRDDN KGLSQLYQLQ LRDLSQPQNV YNEPFDSRLE FAQAEFGKQD FSLKSGRSDA FTWPTIGRVG DEAFRMAAQR LGTEGSTGIS SPKRYLWDDQ PYSPGWRFSQ AFVKSDREPL ATAAPLLYML NDQGKLLIRL PEDERMPVFS PVYSRSSLMT MMLSEVLSQA LMQINSPAQR LKMNHASTPR RLRNVIMTVP PAMPKPERAI FEQCMLDAIR LVWKALGWEE MDDESEDNEQ LKHPRPGVHV KWDEATCGQL VYLYNETQTY FGGRTDEFFA ATRRPDNAPA ANAPRSLKVA SIDIGGGTTD LVISRYTLDD GEGINVRITP KQLFREGFKV AGDDILLDVI RLYLQPAVKA AIVKVGHSDM TAESMMSQLF GSESIEAGKQ VLRQQLTLQI FAPLALAILH RYEEYSPESG REQLSFTFRD LLTENMPTQK VQDYVNDVVR MGQLSSEALF SILDVPLEID LANLHNEFIN PRSGRMNICH SLRALCEVLW HYNCDVLLLT GRPSRLPGIQ ALIRQLQPVP PSRVLPLHGY ETGGWYPFNK KGCIDDPKST ASVGAMLYLL AENSRLSSFF FRTQNFVPYS TIRYLGMLDG NNLIKDSNVF YRDIDLDAPA FQLPQGQSFD ARGEVRIGFR QLDNERWPAS ALYTLKIANP NLASELAGDA MMRIELTAEQ GRPRNGIEAV SPEKFRIESL ETDHARRNYN RKDVAFQLNT MVGNGLSETH YWLDSGSIKS
|
| |